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Summary 

We study the following growth model on a regular d-arj tree. Points at distance n adjacent to 
the existing subtree are added with probabilities proportional to where a < 1 is a positive 
real parameter. The heights of these clusters are shown to increase linearly with their total 
size; this complements known results that show the height increases only logarithmically when 
a > 1. Results are obtained using stochastic monotonicity and regeneration results which may 
be of independent interest. Our motivation comes from two other ways in which the model 
may be viewed: as a problem in first-passage percolation, and as a version of diffusion-limited 
aggregation (DLA), adjusted so that "fingering" occurs. 
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1. Introduction 

Consider the following dynamical method for growing a subtree of the regular d-ary tree B. 
Initially the subtree consists of only the root. Vertices are then added one by one from among 
those neighbouring the current subtree. The choice of which vertices to add is random, with 
vertices in generation n chosen with probabilities proportional to a^"^, where a > is a fixed 
parameter. Let denote the subtree at step n and let h{An) = max{|x| : x G A^} denote 
the maximum height of a vertex in A^. We are interested in the form of the infinite cluster 
Aoo '■= U^o^n; as Well as the behaviour of h{An) and related quantities as n — > oo. In this 
paper we treat the case a < 1; the case a > 1 has already been studied. Our main result, 
contained in Theorem 6.4 and Corollary 6.5, is as follows. 

Theorem 1.1 (Strong Law and CLT). Let B be the regular d-axy tree and < a < 1. There 

exist constants /Uo(a, d) G (0, 1) and cr^ = a^{a, d) > such that 

(a) liuin^oo h{An) = iJ.o{a,d) a.s. 

(b) n~^/^{h{An) — nfio{a,d)) N{0,a^) as n ^ cx) . 

This model has arisen in a number of different contexts, and, as we will see below, can be 
interpreted both as diffusion limited aggregation and as first passage percolation on the tree B. 

The case a — 1 for binary trees arises in binary search algorithms in computer science and 
has been studied by numerous authors including Pittel (1984) and Devroye (1986). The case 
a = 2 (again for d = 2) arises in an entropy estimation procedure of Ziv (1978) and has been 
studied by Pittel (1985) and (along with a > 1) by Aldous and Shields (1988). A discussion 
of the cases a = 1 and a = 2 within the general context of random search trees in computer 
science may be found in Chapters 2 and 6, respectively, of Mahmoud (1992). 

If q; > 1 it is easy to see that A^q :— U^^qA^ is almost surely the entire d-ary tree. Simply 
look at the vacant node xq closest to the root, note that dAn has {d — l)n + 1 points of which 
xo is the most likely to get filled, and use the fact that nfe^n(-'- ~ ii^ ~ + = 0- (This 

argument also works for any tree B of bounded degree.) In the case a > 1 it is therefore natural 
to look at the growth rates of both h{An) and l{An) = min{|a;| : x G A'^}. 

Theorem 1.2. Let B be a binary tree (i.e. d — 2). 

(a) (Pittel (1984), Devroye (1986)) If a = 1, then, writing /?i = 4.311..., /9o = 0.373... for the 
roots of the equation \f3e^^~^'>/^ = 1, 

lim h{An){log2n)~^ = and lim l(An){log2n)~^ = Po a.s. 

n— >oo n— >oo 

(b) (Aldous and Shields (1988)) If a > 1, then 

hm h{An){log2n)~^ = lim l{An){log2n)~^ = 1 a.s. 

n— >oo n— >oo 

The case o; = 2 is also in Pittel (1985, Corollary 1). We remark that Aldous and Shields (1988) 
also study the form of the cluster between l{An) and h{An), and that while they do not state 
explicitly the result for l{An) and a > 1, this is readily derived using their methods. In addition, 
they mention the growth dynamics for a < 1 as an interesting open problem. 
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Comparing Theorems 1.1 and 1.2 we see that there is a dramatic phase transition at o: = 1. 
For a > 1 the process {A^} exhibits the same balanced growth as a deterministic procedure 
in which vertices are added in lexicographic order. Thus the subtree is essentially as short as 
possible, {h{An) ~ log2n) and l{An)/h{An) 1. For a — 1 the results of Pittel and Devroye 
show that fluctuations on a logarithmic scale arise, while for o: < 1 Theorem 1.1(a) implies that 
the subtree is essentially as long as possible {h{An) ~ n). (In this case it is also easy to see that 
Aoo 7^ B - see Theorem 2.1 and Remark 2.2 below). 

It would be of interest to study this "phase transition" at a = 1 more closely. An under- 
standing of the asymptotics of /Uo(a, (i) as a t 1 would be a step in this direction. The bounds 
in (6.9) show that iJ,o{a,d) > ci((i) exp(— A((i)/(1 — o:)), (where \{d) > 0), and we suspect that 

(1.1) < \im.{l — a)logiJ,o{a,d)~^ < oo. 

all 

Our original motivation for studying this model was that it is a version of diffusion-limited 
aggregation (DLA) on a tree. DLA on was introduced by Witten and Sander (1981) to model 
aggregates of a condensing metal vapour, and since then it has been attracted much interest as a 
model for various physical phenomena (see for example Vicsek (1989)). DLA is a Markov chain 
taking values in the space of connected finite subsets of Z*^. Given the current configuration, 
particles diffuse in from infinity according to a random walk conditioned to hit the "boundary" 
of the current cluster and attach themselves to the first point they hit which is adjacent to the 
current cluster. Although the process has quite a simple description, there are very few rigorous 
mathematical results. If A^ is the cluster at step n, Aq = {0} and h{An) = max{|a;| : x e A^}, 
then one hard open problem is to prove the existence, and find the value of 

loghjAn) 
Pd = lim — . 

n— >oo log n 

Kesten (1987, 1990) has shown the limsup of the above ratio is at most 2/{d + l). A conjectured 
value of (3d which agrees quite well with numerical simulations is {d+l){d'^ + (Lawler (1991, 
Sec. 2.6)). There is no rigorous lower bound for Pa aside from the trivial Pd > d~^, and hence 
no rigorous proof of the existence of the "fingering" (Pd > d~^) which simulations suggest. For 
some further surveys of DLA from a mathematical perspective see Lawler (1991) and Barlow 
(1993). 

Analyzing DLA on Z'^ is a hard problem, but the same process on a tree is tractable for 
two reasons: 

(i) There is a simple formula for harmonic measure on the boundary of a cluster (see Lemma 1.3 
below). 

(ii) The absence of loops in the graph means that disjoint parts of the cluster evolve nearly 
independently. 

While the usual heuristic is that a d-ary tree (or Bethe Lattice) will exhibit the limiting 
behaviour for Z'^ a,s d ^ oo, we do not know to what extent our model is relevant to DLA on Z*^. 
However, it may be interesting to note that, even though the harmonic measure of any cluster 
is easy to calculate, the proof of Theorem 1.1(a) is still quite long and hard. 
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To describe more precisely the connection between the model given above and DLA on a 
tree, we need some more notation. We begin by presenting the notation used to describe an 
abstract rooted tree with no leaves. In order to be able to move pieces of trees around, we find 
it convenient to view an arbitrary rooted tree (with countably many vertices) as a subset of the 
universal rooted tree T = UJ^q^"^ where N° = {0} and denotes the root of each tree. (Here 
N is taken not to include zero.) First, the notation for T itself is as follows. If a; e T, |a;| = n 
if and only if x G N"" and x\j = (xi, . . . ,Xj) for j < \x\ (we set a:|0 = 0). If x,y G T, let 
X Q) y = (xi, . . . , x\x\, yi, . . . , y\y\) {0 (B X = x) and x A y = (xi, . . . , Xj) where j is the largest 
integer such that x\j — y\j {x A y — if j — 0). We write x < y if x is a,n ancestor of y, i.e. 
y = X ® z for some z, and we say x is the parent of y if in addition, \y\ = \x\ + 1, and write 
X = par(y). Next, we view an arbitrary ordered (the children of each node come with an order) 
tree as a subtree of T. Given m : T ^ N, inductively define the associated locally finite rooted 
tree (with no leaves) B C T by 

(1.2) B(0) = {0}, M{n+1) = {x®i : X e B(n), z G {1, 2, . . . , m{x)}}, B = U^^QM{n). 

Note that m(a;) is the number of children of x: the condition m{x) > 1 is equivalent to the 
assertion that B has no leaves. If m{x) = d for all x we obtain the regular d-ary tree. The 
values of m{x) for x ^ B are of course irrelevant. Intervals B(j, /c), B(_7, oo), etc. are defined, for 
example, by B(j, k) = Uj<„<fcB(n). 

A subset A of B is a rooted subtree if par(a:) G A for all x E A. S (respectively, 5*0) denotes 
the set of all (respectively, all finite) such subtrees. For A E S, the (external) boundary of A is 

dA= {x e A" : par{x) G A}, 

and the height of A is h{A) = sup{|a;| : x G A}. 

Assume, until we indicate otherwise, that B is the regular d-ary tree. Fix a > 0. Let Q^^ 
denote the law of the random walk [Yq, Yi, . . .) on B started from initial distribution fi, with 
transition probabilities given by p(x, y) = a/{a+d) when y is the parent of x, p{x, y) = l/{a + d) 
when y is a child of x and x is not the root, and p{0,y) = l/d when y is a child of 0. Thus 
(F„,n G Z_|_) is the random walk on B obtained by assigning conductances a~'^ to each edge 
from generation n to generation n + 1. Write t{A) = min{n E Z+ -.Yn e A} for the hitting time 
of yl C B. 

The following lemma describes harmonic measure on the boundary of an arbitrary subtree 

of B. 

Lemma 1.3. Let A E So he a, non-empty rooted subtree of B and fix an N > h{A); let 
Hn = d~^ Ylix&{N) be the uniform measure on B(A'"). If aQ = a Ad then 

(1.3) <5^~(n(SA) = x\r{dA) < 00) = «-!"'( «o ^ ^ dA. 

y£dA 

Proof. Note that \Yn\ is a simple refiecting random walk on Z_(_ which moves to the right with 
probability p = da~^ {1 + da~^)~^ and to the left with probability 1—p. Therefore ify E M{N) 
and < j < iV, then 

Q^y{\Yr,\ = i for some n > 0) = ((1 -p)/p) A 1)^"^' = (cto/rf)^"^'- 
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If T{dA) < oo, X E dA, and |lo| = N > h{A), then IV(a^) = x ii and only if Yq is one of the 
d^~\^\ descendants of x in M{N). Therefore if a; e dA, then 

Q'''' (Yr^dA) ^ x,T{dA) <oo)^ J2 d-^l{x<y)Q^y{T{dA) <oo) 

yeM{N) 

and the result follows. □ 
As the hitting distribution in (1.3) is independent of N we arrive at the following definition. 
Definition. If ^ e S'o and A^ $, then harmonic measure on OA with parameter o; > is given 

by 

yedA 

Strictly speaking, in view of Lemma 1.3, we should restrict to a < d, but allowing o; > d is 
harmless, and in any case in this work we are only interested in a e (0, 1). 

Definition. DLA on B with parameter a > is the /^o-valued Markov chain {An,n e N) such 
that Ai = {0} and A^+i = AnU {D^+i}, where 

(1.4) P{D,,+i=x\A,,)^H^aJx), xedAr,. 

It is clear from (1.4) that (for < a < d) the growth model described at the beginning of 
the Introduction is exactly DLA on B. 

For Q! < 1, Hqj^ favours large |a:| values in dAn, for a > 1 it favours small \x\ values 
and for ct = 1 we have the uniform law on dA^ (the "Eden model"). From the perspective of 
classical DLA the case o: < 1 is of greater interest as it is here that we obtain "fingering", i.e. 
h{An) ^ (log2 n)^ for some /? > 1. This model has been studied in the physics literature - see 
Vannimenus et al. (1984). As well as some calculations in the case o; = 1, this paper describes 
computer simulations which suggested that h{An) ~ c(a)n when d = 2 and a < 1. 

A second motivation is that our model is equivalent to first-passage percolation. In first- 
passage percolation, each edge is assigned a positive random variable, called a passage time, 
and thought of as the time it takes for water (or information, etc.) to pass from one endpoint 
to the other. To each vertex x one associates a time T{x), which is the minimum over paths 
from the root to x of the sum of the transit times. This represents the time before x gets wet 
when the root is a source of water from time onward. The model has been widely studied on 
Z'^, and some quite nontrivial results have been obtained on trees as well (see Bramson (1978) 
and Pemantle and Peres (1994) for two examples). On a tree, it is natural to rescale edges so 
that ones more distant from the root are shorter. For example, the tributaries of a river system 
behave this way; see also the limit trees of Aldous (1991), whose edge lengths decay exponentially 
in the distance from the root. While the edge random variables in first passage percolation may, 
of course, have any distribution on [0, oo), the simplest case is that of exponentials. In this 
case the lack of memory property of the exponential distribution implies that the cluster C{t) 
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of vertices which are wet at time t will be a continuous time Markov process on the space of 
rooted subtrees S. 

In Section 2, we prove that DLA on B arises from first-passage percolation when the passage 
times are independent exponential random variables with mean a"^ (for an edge from M{n) to 
B(n + 1)). Throughout the paper, we will use these various viewpoints interchangeably; thus 
we usually refer to the subtree as a cluster or as the DLA, but keep the notion of passage times 
in the foreground as well, and in fact, most of our analysis takes place in the continuous-time 
setting of first-passage percolation. 

The organization of the paper is as follows. In Section 2 we show that DLA can be embedded 
in first-passage percolation, and give a number of general results on the model. Theorem 2.1 
proves that either A^o — B a.s. or else the cluster has unique infinite line of descent 
(the "backbone"). In the latter case it is possible to decompose Aoo into the backbone plus 
a sequence of finite clusters attached to successive points of the backbone. These clusters are 
i.i.d. given the "percolation times" along the backbone (Theorem 2.5). If a < 1/d, a simple 
law of large numbers argument, based on estimates of cluster sizes in Lemma 2.4, shows that 
limmfn^oon~^ h{An) > c{a, d) > a.s. (Theorem 2.6). This result exhibits our basic approach 
while avoiding the technical problems involved when considering the case when a is close to 1. 
Finally, Theorem 2.8 gives a general strong Markov property. 

The hard work is in Sections 3 and 4. Section 4 contains the key estimates on the sizes 
of the finite clusters, giving bounds in (Theorem 4.4) and in (Theorem 4.6). To handle 
the dependence which arises in these proofs it is necessary along the way to prove stochastic 
monotonicity results for the conditional distribution of the clusters given the backbone times, 
and this is done in Section 3 (Lemmas 3.1 and 3.3). Section 5 shows that the percolation times 
along the backbone form a Markov chain, and that this chain converges exponentially fast to its 
stationary measure (Theorem 5.2 and Corollary 5.7). This paves the way for a Strong Law of 
Large Numbers (Theorem 6.1) and Central Limit Theorem (Theorem 6.2) holding for a general 
class of functionals of the finite clusters. In Section 7 we find a sequence of regeneration times 
for cluster, which allow it to be decomposed into i.i.d. pieces. Combining these results with 
the theorems in Section 6 we complete the proof of Theorem 1.1. Finally, Section 8 adds some 
remarks on the asymptotics of the growth dynamics as o: t 1, and on a related particle system. 

In most of this work we will only be concerned with the regular (i-ary tree, and the DLA 
model described above. However, we may note that most of the results of Sections 2 and 3 hold 
for a more general model. First, we may consider a general locally finite rooted tree B, defined 
by (1.2). Secondly, we can fix a function / : B — > (0,oo), and take the passage time between 
par{x) and x to be exponential with mean f{x). We can then consider a process An which has 
growth probabilities given by 

p{An+i=Anu{x}\An)^f{x)-\ Yl f(yyy'- 

While these extensions involve no new ideas, describing the tree and the process in this 
more general setup does require some quite cumbersome notation. Apart from Theorem 2.1, 
and some simple estimates on cluster size in Lemma 2.4, we will therefore restrict our proofs 
to the case of a regular tree and f{x) — o;'^'. An earlier version of this paper, which treats 
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the general case in Sections 2 and 3, is available by anonymous ftp from ftp.math.ubc.ca 
(directory pub/barlow.) 

A general notational convention is that j denotes a globally defined constant introduced 
in Section i, whereas c, c', c{a), . . . may represent different values in different lines. Dependence 
of Ci,j on parameters such as {a, d) will at times be suppressed if there is no ambiguity. The 
integral of a function (p with respect to a measure /j, (or P) is written yu((/?) (or P(v'))- 

2. The Continuous Time Model for General Trees 

Let B be a general locally finite rooted tree with no leaves and let / : B ^ (0, oo). Consider 
the 5'o- valued Markov chain {A^} such that Ai — {0} and A^+i — An U {Dn+i} where 



(2.1) FiDn+, = x\An) = I fiv)-')-' for x e dA 

[0 for x f dAn- 

Note that f{x) = a'^' on the regular d-arj tree gives the cluster dynamics of the previous 
section. Set A^q = U^o^n- We now embed (A„, n > 0) in a continuous time process. 

On some complete (0,JF, P) let {Ux : a; G B} be i.i.d. exponential random variables 
with mean one, and define T{x) = ^y<,^ f{y)Uy. (We set T(par(0)) = 0). In terms of first 
passage percolation, if f{y)Uy is the time for liquid to percolate from par{y) to y and f{0)Uo 
is the time it takes for to get wet, then T{x) is the time it takes to percolate to x. Let 
C{t) = {x : T{x) < t} be the nodes which are wet at time t and define a{t) : C{t) ^ [0,oo) by 
a{t){x) = t — T[x). Let S = {(C, a) : C E a : C ^ [0, oo)} and let Sq denote the same set 
with So in place of 5". If A is added to M as a discrete point, define : 5 — > M U {A} for x 
in B by U^iC^a) = a{x)l{x E C) + Al{x ^ C). Give S (respectively Sq) the smallest cr-field 
J^{S) (respectively, J^{So)) generated by the maps {Ux : x eM} (respectively their restrictions 
to (So). The process Y{t) = (C(t),a(t)) is an iS-valued process. — a{T{x) f\t,x eM) and 
Qt = Qf_^ are filtrations on (O, JF) and T{x) is a (Qt) - stopping time for all a; in B because 
{T{x) <t} ^ {T{x) At <t} E g'^. Clearly Y is (^t)-adapted because n^(F(t)) takes on the 
value t — T{x) on {T{x) < t} and the value A elsewhere. 

Let cardn = inf{t : i^C{t) — n}: thus {cardn} is a sequence of a.s. finite (^t)-stopping times. 
Clearly C(cardn+i) = C(cardn) U {Dn+i} where -Dn+i £ 5C(cardn). Using the lack of memory 
property of the exponential, it is easy to see, as in Section 1 of Aldous and Shields (1988), that 
the process An = C(cardn) satisfies (2.1). We therefore may, and shall, take An = C(cardn) 
throughout this work. 

Notation. Let T{oo) — lim^^oo cardn < oo be the time to percolate to oo. It is clear that 
Aoo = C(T(oo)-) := {x : T{x) < T(oo)}. Set T{x, y) = T{y) - T{x) if x < y and T{x, y) = oo 
otherwise. Let 

T{x, oo) = lim min{T(a;, y) : y > x,y E B(n)}, 

n— >oo 

T{x-, oo) = lim min{T(par(x), y):y>x,yE B(n)} = f{x)Ux + T{x, oo). 

n— >oo 

Thus T(x, oo) is the time to percolate from x to oo, while T(x— , oo) is the time to percolate 
from par(x) to oo through x. If x G B, let [x, oo) denote {y E M : y > x} and similarly define 
{x, oo), [0, x], etc. For G C B let J-'g — cr{Ux : x E G) and define J-'n — (7{Ux ■ \x\ < n). 



6 



The next result is due to Brennan and Durrett (1986, Sec. 3), but we include a proof 
because the settings little different. 

Theorem 2.1. (a) P(T(oo) < oo) = or 1. 

(b) If P(T(oo) = oo) = 1 then Aoo = B a.s. 

(c) If P(T(oo) < oo) = 1 then there is a.s. a unique infinite line of descent in A^o, i.e., there is 
a unique sequence {spine^, n > 1} such that 

(2.2) spine^ e B(n) n^oo spine^ = par(spine„_,_i), for all n>l. 




Remark 2.2. If B is a regular d-ary tree and f{x) = g{\x\) then by comparing T(oo) with the 
time to percolate to oo along a fixed path we see that 



Rather surprisingly, if g is monotone then the converse is also true: ^g{n) = oo implies 
P(T(oo) < oo) = - see Pemantle and Peres (1994). 

Proof, (a) The event {T(oo) = oo} = na;€B(n){^(^' ^) = ^} clearly in T = nr=i ^M{n,oo)- 
The latter is a — 1 cr-field by the Kolmogorov — 1 theorem. 

(b) Clearly T(oo) = oo a.s. implies A^o = {x : T{x) < T(oo)} = B a.s. 

(c) Fix j e N. To show that for each j there is a unique choice of spine^- it is enough to show 
that 

(2.3) There exists a unique G B(j) such that Aqq fl [X^ , oo) is infinite. 

Assume (2.3) fails. Then there are distinct vertices xi,X2 in such that with positive 

probability A^q n [xi,oo) is infinite for both i = 1,2. It follows that, with positive probability. 



Therefore conditional on J-'j-i, T{xi—, oo) — T{x2—, oo) has an atom at the -measurable 
point T(par(,T2)) — T(par(xi)) G M. On the other hand T(xi — , oo) and T(x2 — , oo) are indepen- 
dent random variables with densities (because T{xi — , oo) = Ux^f{xi) +T(xi, oo) with T{xi, oo) 
independent of the exponential random variable U^i) and are jointly independent of J^j-i- 
Therefore T(a;i— ,oo) — T(a;2— ,oo) has a conditional density given Tj-i. This contradiction 
completes the proof. □ 

Similar arguments later will require the following quantitative estimate on densities of sums 
of the variables Ux- The elementary proof is omitted. 

Notation. Let Sp{a) = nr=i(l ~ ^ ^ (0) 1); P £ Z+ U {oo}, and write s{a) = Soo{<y)- 

Lemma 2.3. If {f/^, i G Z_(_} are i.i.d. exponential r.v. with mean 1, and a G (0, 1), then for 

p G U {oo}, Sp — Yl^=o Oi^Ui has a density Up{t) < Sp{a)e~^. 




oo 




P(T(oo) < oo) = 1. 



T{par{xi)) + T{xi-, oo) = T(oo) = T(par(x2)) + T{x2-, oo). 
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Notation. If x e B, then = {y : x (B y & M} is the locaUy finite rooted tree of descendants 
of X (properly translated) and Mx{n) = {y & '■ |y| = n} is the n*'* generation of the nodes in 
M^. If a; ^ and y e B^, let 

T^''\y)^f{x)-'T{par{x),x®y) 

be the rescaled percolation times for the tree Ba;. Let 

T(n) = inf{T(x) : x G B(n)} and , 
T(^)(n) = inf{T(^)(y) :yeM^{n)} 

be the percolation and rescaled percolation times, respectively, to the n^^ generation in B and 
Ba;. Finally, let 

T(^)(oo) = T(a;-,oo)/(a;)-^ 

be the rescaled time to percolate to infinity in B^. Note that if B is a regular d-ary tree 
and f{x) = al^l, then B^; = B for aU x, and {T^'^Hy) : y E M} has the same distribution as 
{Tiy) :yeM}. 

We now derive upper bounds on the and norms of the cluster size at a fixed time, 
conditioned on being finite. These bounds are crude but their proofs are fairly simple, and 
the bounds are good enough to enable us to prove that liminf > in the case when 

f{x) — Qjl^l and Q;supa.m(a;) < 1 (c.f. Theorem 2.6 and Remark 2.7). In Sections 3 and 4 we 
will have to work much harder to obtain better bounds (e.g. Theorem 4.4) which lead to the 
linear growth of h{An) for a regular d-ary tree and all a e (0, 1). 

Lemma 2.4. Let f{x) = al^l for some a e (0, 1). 

(a) P(#C(t)|T(oo) >t)< c{a) J^n^o i^Hn)a'' for all t > 0. 

(b) P(#C(t)2|T(oo) >t)< c(a)E.eB«'''(E::=o«'"#®.M)' for aU t > 0. 
Proof, (a) If a; e B, t > and 

Soo is as in Lemma 2.3, then 

F{T{x) < t < T(oo)) < P(P(t - T(x, oo) < T{x) < t\T{x, oo))) 

< s(a)P(e-^*-^(^'°°))T(a;,oo)) (Lemma 2.3) 
^'^'^^ < s(a)e-*P(exp(al^l+i5oo)«l^l+'5oo) 

< (s(a)/(l - a)) (Lemma 2.3). 

Using the fact that P(T(oo) > t) > F{Uo > t) = e"*, we conclude that 

P(#C(t)|T(oo) >t) = ^P(r(x) < t < T(oo))P(T(oo) > t)'^ 

oc 
n=0 



(b) Let e B, z = 1, 2 be both distinct from xi A X2- Let p = \xi A X2\, x[ = Xi\{p + 1), and 
write Xi = x[®yi. Then 

¥{T{xi) V T{x2) < t < T(oo)) 

< P(T(a;i A X2) + a^+^T^^^ (yi) < t < T(xi A X2) + aP+^T'^<\oo), i=l,2) 

2 

(2.5) =P(nP(T(-^)(y,) < {t-T{xiAx2))a-P-' <T^'''^\oc)\J^p)) 

i=l 

< c(tt)P(l(T(xi A X2) < t) exp{-2a-P-'^{t - T{xi A a;2)))al''il + l'^^l"^^) (by (2.4)) 

< c(a)e-*al^il+l^2|-^. 

In the last hne we again used Lemma 2.3 with Sp ~ T{xi A X2)- If either xi or X2 equals 
X = xi A X2 the above inequality is clear from (2.4). Decomposing the sum over xi,X2 in B 
according to the value of ^ = xi A a;2, we obtain 

P(#C(t)2l(T(oo) > t)) = Yl nT{xi)VT{x2)<t<T{oo)) 

00 00 

^ E E E #B.(m)#B,(n)c(a)e-*a-+"+l-l (by (2.5)) 

z€B m=On=0 

00 

< c(a)P(T(oo) > ^) ( E ( E (™)«") ') • ° 

z£M m=0 

These bounds are far from optimal. The very first inequality in the proof of (a) ignores 
the critical fact that T{x) < T{oo) only holds for a small proportion of vertices x in B(n). This 
leads to the more restrictive conditions on a in Theorem 2.6 below. 

In the remainder of the paper we will assume B is a regular d-ary tree and 
f{x) = a'^l with a G (0, 1). By Remark 2.2 wc have T{oo) < 00 a.s. The unique infinite line 
of descent in A^o, defined by (2.2), is called Spine = {spine^,r2, G N}, or the "backbone" of the 
cluster Aqq. The cluster A^q may be partitioned into Spine and a collection of disjoint finite 
clusters which branch off Spine. 

Notation. For n G Z+ and x G B(n + 1), let {s\hj{x) : j < d} denote the siblings of x, i.e., the 
points in {(a;|n) Q) i : i ^ Xn+i,i < d} in increasing order of i. We write e^j for sibj(spine„^.i) 
and for n G Z+ and 1 < j < d, set 

Ciust„,j = {a; G B : e^j ® a; G ^00} • 

Thus {Clustnj- : j < d} are the (possibly empty) clusters which branch off the backbone in 
generation n. Define a^j '■ Ciust^j — > [0, 00) by 

an,jix) = (Tioo) - T{en,j x))a-''-\ 

and let Y^j = (Clust^j-, a„^j), which is almost surely in Sq by Theorem 2.1. Let Wn = 
T(spine^, oo)q:~'^~^ denote the normalized time to percolate along the backbone from gener- 
ation n to infinity. Let W denote the sequence {Wn : n = 0, 1, 2, . . .}. For each t > 0, define a 
law ut on Sq by 

Ut{-)^F{Y{t) G ■|T(oo) >t). 
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Theorem 2.5. Conditional on (Spine, W), the collection {Y^j : n e Z^, j < d} is independent 
as n and j vary, and the joint conditional distribution of each Y^j is given by 

P(yn,i e -ISpine, W)(u;) = i^Wr,{oj) ■ 

The intuitive explanation of this is that the only information passed to the subtree beneath 
by conditioning on the backbone is that the time to infinity inside this subtree has to be greater 
than the time along the backbone. 

Proof. Choose N & N and x e B(A^). Consider an event D = C\o<n<N i<j<d-^nJ^ where -Dnj 
is of the form 

Dn,j = {Yn,j = {bn,j, a) for some a e Kj}, 

where bnj G and Fnj is the set of nonnegative functions (p on bnj such that ip{y) e F'^'^'^ for 
each y, where {F"''-''^} is a specified collection of measurable sets. This class of events generates 
criYn^j : n G j < d) and is closed under finite intersection, so it suffices to show that 

P(i?|Spine,W) = J]zyM/„(^n,i). 

If Vn = t^spine„, then clearly (7(W) = <t(K : n G N). For n < N, let Rn = T{x\n,x) + 
T(a;, oo) be the time to percolate from x\n to infinity through x. Note that spine^y = a; if and 
only if for each < n < A/" — 1, the fastest route from x\n to infinity is through a;, that is, 

(2.6) {spine^ = a;}= fj fj {i?^ < T(sib,(x|(n + 1))-, oo)} . 

0<n<Ar l<j<d 

Note also that on {spinejy = x} we have 

Clust,,,- = {y' : T^s\b,i.\n+i))^y>^ < 

and for y e Clust^j, 

an,j{v) = (Tioo) - r(sib,(a;|n + 1) 

= {T{x\n) + Rn- T{s\bj{x\n + 1) © y))^-"-^ 
= {Rn - T{x\n, s\bj{x\n + 1) © y))a~'^~^ 
= i^ntt"""^ - T(^"^^(^l"+^))(y) . 

If Bn are measurable subsets of the positive reals, then using (2.6) and the above, we have 

F{Vn e Bn for 1 <n < N, spine^v = x, Y^j G Dnj ior < n < N, 1 < j < d) 

N-l d-1 

=p(l(C/^l„ eBniovl<n<N)l[Y[ {l(i?„ < T(sib,(a;|n + 1)-, oo)) 

n=0 j=l 

X l{{y : T(2ib,(.|n+i))(^) < Rna'^-'} = bnj) 

X l(i?„a-"-i - T(s'b,(x|"+i))(y) e F"'^'^ for aU y G 6^^)}) . 
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Let g{x) = J^io,x] ^ ^[x,oo)- Observe that Rn is ^(x)-measurable and that T(s'^j (^l^+i))(y) and 
T(sibj(x|n + 1) — , oo) are ^[sibj(x|n+i),oo)" measurable. The collection of a-fields ^[sibj(x|n+i),oo) 
for j < d and < n < A^, together with ^(x), are all mutually independent. Condition the 
above integrand with respect to g{x) to see that it equals 



/ 



N-l d-1 



l{U^ln{uj) eBn,l<n<N)l[l[ {P(i?n(a;) < T(sib,(x|n + 1)-, oo)) 

n=0 j = l 

i?n(a;)a-"-i < T(s"^^ (^I^+i))(oo)) } dP(a;) . 

By (2.6) the product of the first factors in the curly braces equals P(spinejY = x\Q{x)){u!). On 
{spinejv = x} we have RnOL~^~^ = for n < N, and so the above leads to 



F{Vn e BnfoT 1 <n< N, spine^ = x, D) 

= J l(spinejv = x, ?7spine„ & ioi 1 < n < N) 



N-l d-1 



n=0 j = l 

|j.(sib,(x|n+i))^^) > Wn{u;)) dF{u;) 

N-l d-1 

1(K e for 1 < n < iV, spine^ = ^) 11 11 ^^^i^) ■ 

n=0 j = l 

We have used the equivalence in law of {T^^^ (y) : y e B} and {T{y) : j/ e B} in the last line. □ 

The above decomposition and the and bounds in Lemma 2.4 allow us to use the 
law of large numbers to establish linear growth of h{An) for sufficiently small a. The proof 
illustrates the basic approach we will take in Section 6 to obtain the result for all a < 1. 

Theorem 2.6. Assume a E {0,d~^). Then 



liminfn h{An) > C2.i(q;, (i) > a.s. 

n—>oo 

Proof. If cr(n) = inf{A; : h{Ak) = n}, the result is equivalent to 
(2.7) lim sup n~^a"(n) < c < cxd a.s. 

n— >oo 

Decompose C{T{n)) into the backbone vertices in C(T(n)) (there are at most n + 1) and the 
portions of the clusters {ek,i © Clustfc^j, k < n, i < d} which are contained in C{T{n)). This 
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shows that 



n n n ^-^ * ^-^ 



A;=0 \ i=\ 



Conditional on (W, Spine), {#ClustA;,i : /c > 0, i < d} are independent random variables 
(Theorem 2.5) such that 



P(#Clust^,,|Spine, W)(a;) = iv^„M((#C)^) 

< c{a) ^ al^l ( J2 or(rf (Lemma 2.4) 

< c(a)(l - ad)-^ 



Therefore UkAi'^) = P(#ClustA;,i |Spine, W)(a;) is also uniformly bounded by c{a,d) say, and the 
strong law of large numbers (applied conditionally) implies that 



d-i 



limsupn ^ ^ ^(#Clustfc,j - iJik,i) = a.s. 

fe=0 i=l 

Use this in (2.8) to see that limsup^^o^ a{n)n~^ < 1 + dc{a,d) a.s., thus proving (2.7). □ 

Remark 2.7. In the more general setting of Theorem 2.1 (general B and /), it is just as easy to 
decompose A^o into Spine (the backbone) and clusters {Clustnj : j < m(spine„), n e Z_|_} which 
branch off the backbone in generation n. With only notational changes in the proof, it is then 
possible to derive an analogue of Theorem 2.5. The lack of scaling means the conditional law of 
will also depend on the tree Bg^ ^ and the appropriately shifted and rescaled version of 
/. One can then show that Theorem 2.6 remains valid if B is a rooted tree with no leaves such 
that m(x) < d for all x and f(x) = al^' for some a e (0, d~^). The proof is the same. 

Notation. The wide sense past up to x is defined by Ex = J-(^x,ooY ^ind we let 8^- = ^[a;,oo)=- 

The following strong Markov property will be used in Section 6. It states that if you stop at 
a stopping time when par{x) has been reached, but x has not been reached, then the remaining 
times to hit vertices from the subtree rooted at x, rescaled, are equal in law to the original 
system of hitting times. As might be expected, the main difficulty in obtaining this strong 
Markov property is in getting the statement right. 

Theorem 2.8. If a is an a.s.-finite (^t)-stopping time and a; e B, then for each measurable 

Be [0,oo)», 

(2.9) p(((T(a; © y) - a)a-\^\ :yeM)eB\g,V = P((T(y) : y e B) e S) 

almost surely on {x G dC{a)}. 

Proof. Note that {x G dC{a)} = {T(par(x)) <a < T{x)} is in g^. (Recall that T(par(0)) = 0). 
Assume first that a > is constant and consider (2.9) with in place of Qf Let u be the 
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exponential law with mean 1 and let denote product measure on [0, oo)^. Define Uq = 
{T{x) - C7)a-I^l and Uy = U^^^y for y e B \ {0}. We claim that 

(2.10) ¥{{Uy : y e B) e -1^?° V = a.s. on the event {x G dC{a)} . 

Indeed, g° V 8^,^ = a(T(y) A a : y > x) V So:- V (t{1{T{x) > a)), and T{y) A (t = for all 
y > X on {x e dC{a)}; the latter event is measurable with respect to £x- V a{l{T{x) > cr)), 
and so (2.10) is equivalent to 

(2.11) F{{Uy :y eM) e -{S^c- V a{l{T{x) > a))) = a.s. on the event {x G dC{a)} . 

Let Fi and F2 be finite subsets of B \ {0} and [x, ooy respectively, let Bq be a measurable set 
of nonnegative reals, and let Bj be Borel subsets of [0, oo)"'^-' for j = 1,2. If Gi = {{Ux^y '■ y G 
Fi) G Bi} and G2 = {{Uy : y G F2) G S2}, then 

F{Uo gSo,Gi,G2,x G dC{a)) 

= P(l((T(x) - a)a-l^l G Bo, G2, T(par(x)) <a< T{x))F{Gi\Sx)) 
= iy^HSi)p(l(G2,T(par(a;)) < (7) 

X F{Ux -{a- r(par(a;)))«-l^l G 5o, C/^ > (a - T(par(a:)))a-I^l |f^_)) . 

Since T(par(x)) G £x- and C/a; is independent of Sx-, the lack of memory property of the 
exponential shows that the conditional expectation term in the last line above is equal to 

u{Bo) exp[-a-l^l {a - T(par(x)))] = u{Bo)nUx > a"!"! {a - T{par{x)))\Sx-) 

on the event {T(par(a;)) < cr}. Substitute this in the previous equation to conclude that 

P(f/o G So, (Uy-.ye Fi) G BuG2,x G dC{a)) = v{Bo)u^^{Bi)F{G2,x & dC{a)) . 

It is easy to see this implies (2.11) and hence (2.10). Noting that 

(T{x®y)-a)a-\-\= J] U,a-\'\ , 

z&M,z<y 

one derives (2.9) for a constant and Q° in place of Qcr- The entire argument generalizes easily 
to the case where cr is a (^°)-stopping time taking countably many values. 

For a general (^t)-stopping time cr, choose (^°)-stopping times cr„ > cr so that cr„ is a 
multiple of with cr^ | cr. Note that x G dC{a) implies that x G dC{an) for sufficiently 
large n. Taking limits in the above result, we arrive at (2.9) with flnl^a^ ^ ^x-) in place of the 
smaller cr-field Qa^ M E^-- The result follows. □ 

Remark 2.9. This result justifies our earlier assertion that the process = C(cardn) is a 
Markov chain satisfying (2.1). (Recall that card^ is the time C(t) reaches size n.) 
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3. Some Stochastic Monotonicity Lemmas 



The stochastic monotonicity results derived in this section wiU play a pivotal role in the 
proof of the key and estimates in Section 4. 

There is an obvious isomorphism between and R{°^ x (r1S(0'^-i))^(i) which we 

denote u t-^ u and we extend this isomorphism to functions (p : M'^^'^'"^) — M by defining 
= f{u). We will use the same notation {u u) to denote the isomorphism between 
^B(n) 11^61(1)^^''^''"^^ and hence define ip{u) = ip{u) for ip : M®(") M. The purpose 
of this notation is to allow test functions ip to be built recursively, yielding inductive proofs of 
distributional inequalities. 

If (5, B, <) is a partially ordered ordered measurable space and v are probability laws on 

{B,B), fi is stochastically smaller than v (write [x < v) \i and only if J '\\)d\i < J ipdu for every 
bounded measurable non-decreasing •0 : S — » M. If the law of X is stochastically smaller than 
that of y , write X -<Y. If B = for A C M and T a countable set, we always partially order 
5 by / < ^ if and only if f{t) < g{t) for aU t e T. Partially order S by (C, a) < (C, a') if and 
only if C C C" and a < a' on C. 

If r is a finite subset of B, t e M, and n e Z_|_, define a probability ^{T, n, t) on (— oo, 0)'" 

by 

u{r, n, t){-) = F{{t - T{x) : a; e F) e • |T(n) > t) 
(for t < the conditioning is trivial). 

Lemma 3.1. 2/(13(0, n), n, s) ~< z/(]B(0, n), n, t) whenever — oo < s < t < oo and n e Z_|_. 

Proof. If n = 0, z/(]B(0), 0, s) = z/(]B(0), 0, 0) for s > by the lack of memory property of the 
exponential distribution and is trivially stochastically non-decreasing for s < 0. 

Assume for induction that the result holds for n — 1. Fix t e M and for z e B(l), let 
(Pz be bounded measurable functions on ]R"(0'"'~i). Let (po be bounded and measurable on M 
and define (p on ]R'^(°'"') by (p(u) = (po{uo) n«eB(i) V^zi^i^))- ^^^'^ (p^°^\u) = (p(au), and set 
g{u) = P(T(n - 1) > uy. Use the independence of the vectors = {T^^\x) : x e B(0, n - 1)) 
and integrate out the exponential variable Uq to see that 

P(T(n) > t)u(M{0,n),n,t){(p) 

nOO 

= / e-'<po{t-s) 
Jo 

xF{ Yl l{s + aT^^\n-l) >t)ip^{t- s-aT^^\x) :x eM{0,n-l)))ds 

pOO 

= e-^g{{t-s)a-')ip[''\{t-s)/a) 
Jo 

X Yl H'pi''\{t-s)a-^ -T^'\x):xeB(0,n-l))\ T^'\n - 1) > '^-^) ds 
zeB(i) ^ 

poo d, 

= / e-'g{{t - s)a-') [ (g) z/(B(0, n-l),n-l,{t- s)a-^)] (^(«)((t - s)a-\ •)) ds . 
Jo ,=1 
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The restriction on Lp is then easily removed, to yield the above equality for all bounded measur- 
able on If 

d 

V'(^) = [ (8) i^(B(0, n - 1), n - 1, v)] ((^(«) {v, ■)) , 
a change of variables in the above integral leads to 

zy(B(0,n),n,t)((/p) = (e*P(T(n) > / e'^'' g{v)tl;{v) dv . 

J —oo 

Take Lp to be the constant function 1, hence '0 = 1 also, to see that 

/t/a 
e'^''g{v)dv, 
-oo 

and therefore conclude 

/t/a C^l 
e'^'' g{v)i){v) dv ( / e"^t/('y) dv)~^ . 
-oo J —oo 

If 99 is non-decreasing, then ip is nondecreasing by the induction hypothesis. Since (3.1) expresses 
u(E{0, n), n, t){ip) as a weighted average of values of '4> with respect to a weighting measure that 
stochastically increases in t, it follows that z>'(B(0, n), n, t)(</3) is nondecreasing in t and the 
induction is complete. □ 

Corollary 3.2. Vs ~< i^t whenever < s <t. 

Proof. Define T : [0, 00) by T{t){x) = t - T{x) and ^/^ : ^ <S by 

^(7) = (C, a), C = {x: ^{x) > 0}, a{x) = ^{x) ioi x e C : 

here 7 G M'^. Then •0 is non- decreasing and Y{t) — '0(r(t)). It therefore suffices to show 
that t P(r(t) e • \T{oo) > t) is stochastically non-decreasing, and for this it suffices to fix 
m e N and show that if r^(t) = ^{t)\M{o,m)^ then t P(r^(t) e • |T(oo) > t) is stochastically 
non-decreasing (see Kamae, Krengel and O'Brien (1977, Proposition 2)). By taking limits, one 
reduces this in turn to proving 

(3.2) t — > P(r^(t) G ■ \T{n) > t) is stochastically non-decreasing for all n> m. 

It suffices to consider (3.2) with m = n since decreasing m only weakens the conclusion. But 
(3.2) with m = n is precisely the conclusion of Lemma 3.1. □ 

The inductive arguments from here on require a second set of percolation times, defined 
analogously to the first but not including the percolation time at the root of each subtree. We 
apologize for doubling the notation but promise not to do it again. 
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Notation. Let 

Tnot(n) = inf{T(0, x) : x e B(n)} = T{n) - Uq 

T^rSotH = inf{T^^ot(2/) ■■ V e B(n)} = T^-\n) - . 
In short, the T^^^'s include a contribution from while the ^^q^-'s do not; times with super- 
scripts are rescaled. It is evident that {T|^q^^(?/) : ^/ G B} is equal in law to {T^^^^d/) : y eM}. For 
\z\ > 1, let Firstn(2;) denote the a.s. unique vertex in B(n) such that T^^\n) = T*^^) ( Firsts (-z)), 
so that z ® Firstn(^) is the first descendant of z in generation |2;| + n to be reached. (Of course, 
this is not necessarily the one through which infinity is reached from z). Let Firsts be the a.s. 
unique vertex in B(?i) such that T(n) = T(Firstn). For z G B(n) let F^i-) = P(- |First„ = z) and 
{lJ,{z, t) : t > 0} be a set of regular conditional probabilities on (—00, 0]^^'^^ for 

P.((7^notH - TiO.x) : x G B(n)) G • iTnotH = t). 

With the available symmetry, we could have defined t) instead of fi{z,t), but in this 
case keeping greater generality also reduces confusion of types. 

Lemma 3.3. For any 2; G B \ {0}, there is a version of the set {fJ,{z, t) : t > 0} such that 

IJ,{z, s) -< n{z, t) for < s < t. 

Question: Is there a useful description of the increasing limit, iJ.{z, 00) of the measures ^{z,t)7 
How about the measure ^(2, 0)? In either case, sending \z\ to infinity and rescaling by Q!~I^I 
should then yield a locally finite point process on (—00, 0]. 

Proof. We start by establishing a pair of auxiliary results, (3.3) and (3.4), whose proofs could 
be omitted on a first reading. Let z = {zi, 2:2, ... , Zn) — zi (B z' E B(n), n > 1. The first result 
is that, conditional on Firsts = z and on the percolation times from the vertex zi, the vectors 
{T^^\y) : y eM} are i.i.d. as x ranges over the other first generation vertices, and distributed 
as {T{y) : y E M} conditioned on T(n — 1) > TQQi{n){u)a~^. More precisely, letting T denote 
the vector {T{x) : a; G B), we show that 

d-l 

(3.3) P,((T(-) ■.\x\ = 1,x^ z,) G •|.F[,„oo))M = (g)P(T G •|T(n - 1) > Tnot(n)(a;)a-^) , 

i.e., the RHS of (3.3) defines a regular conditional probability for the left side. 
To prove (3.3), note that 

{FWstn = z} = {T^'^\n - 1) = a-'^T{0, z)} n f] {T^''\n - 1) > a-'^T{0, z)} 

\x\=l,x^z 

almost surely. This shows that if (^j, : R'^ ^ R are bounded and measurable for x G B(l), then 
P(l(First„ = 2) n MT^''^))=^{^z^{T^''^)l{T^''\n-l)=a-'T{0,z)) 

\x\ = l 

X n nT^"\n - 1) > a-'T{0, z))^x{T^^^)) • 
xeM{i)\{zi} 
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The term in front of the product is ^[2^^oo)-measurable (since ^[z^^oo) is just a{T^^^^)) and 
conditional on ^[2^,00)) the vectors T^^^ for x ^ z\ are i.i.d. copies of T. Thus 

P,( n ^,(T(-)))P(First„ = ^) 

a;eB(l) 

xeB(i)\{2i} 

X [l(r(^i)(n-l)(a;) = a-^T(0,z)(a;)) JJ P(T(^)(n - 1) > a-^T(0, ^)(a;))] dP(a;) . 

xeM{i)\{zi} 

The term in square brackets equals P(First„ = z\T[zj^^oo)){(^) ^^d so we may conclude that 




From this, (3.3) follows immediately upon noting that T(0, z) = Tnot(^) ^--S- with respect to F^. 
The second result is that if the joint distribution of the times T^^^^{-) is conditioned on 

■^not Firsts = z, then the value of Uz-^ is irrelevant, and the informa- 

tion Firsts = z may be replaced by the weaker First^_i(2i) = z'. Formally, if P^/(-) denotes 
P(-|First„_i(2;i) = z') and : — > R is bounded and measurable, then, letting T^qi denote 
the vector (T^^^{x) : a; e B), we claim 

(3.4) lPz{^{Ti^:^Moti^ - 1)' U.,) = K'i^iT^^oMoti^ - 1)) a-s- 

Note that P^ is absolutely continuous with respect to P^,, so that in (3.4) we are asserting that 
the right side defines a version of the left side. 

To prove (3.4), first drop the conditioning on t/^^ from the LHS by observing t/^^ to be 

independent of a{Tlf^^{-)). Next, write 

{Firstn = z}^ {Firstn_i(^i) = z'} n f| {U^, + T^'^i{n - 1) < T^^\n-1)}. 

xeB{i)\{z,} 

The independence of T^o\.^ and {U^^.T^''^ : x e B{1) \ {zi}) shows that 

P(First, = z|T^^\.)) = P(First„ = 2;|l(First„_i(zi) = ^O^^^not - 1)) a.s. 
If '0 is a bounded, measurable real function, then 

P(First„ = z)F,i^{T^^'^i)i;{Tlf^iin - 1))) 

=P(v'(rnot )^(^not - l)mFirst. = z|l(First„_i(zi) = z'), TS(n - 1))) 
-P(F(¥'(TS)|l(Firstn_i(zi) = z'),T^^^i{n - l))V'(TS(n - l))l(Firstn = z)) . 
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A separate consequence of the definition of conditional expectation is that almost surely on 
{First^_i(2;i) = z'} (which contains {Firsts = z}) 

n^(rS)|l(First.-i(^i) = ^O,r(^ot - 1)) = K'i'PitoMoii^ - 1)) • 
Combining this with the previous identity shows that 

P,(^(T(-))V.(T(-)(n-l))) 

=F{rAv{T^'^Mot(^ - miT^otin - l))l(First„ = ^))P(First„ = z)-' 
=P.(n'(<^(7^rSot - miT^liin - 1))) 

and (3.4) follows. 

The lemma is now proved by induction on n as follows. For n = 1, let Qz = Sq and for 
X e IB(1) \ {z}, let be the law of —a times an exponential of mean 1. The lack of memory 
property for the exponential law shows that (S)a;eB(i) Qx is a version of ij,{z, t) for all t > 0, so 
the result holds with equality of all the laws. 

Assume the result now for \z\ < n, retaining the above notation. Let (px '• 'R^^'^~^^ — > M 
be bounded and measurable, with (p : R'^*^") — M given by ip{u) = Y[xm{i) ^xiui^)), i.e., by 
(f{u{-)) = Y[xeM{i) ^xiu{x © y) : y e B(n — 1)). Note that if a; e B(n — 1) then almost surely 
with respect to P^, 

Tnot(^) = «(t/.,+T(^ot(n-l)) 
(3.5) Tnot(n) - aT'^'^^x) = a{Uz, + T^^'^l{n - 1)) - a(t/,, + T^^^t (x)) 

= «(^not(^-l)-rnot(^))- 
Condition on ^[2^,00) ^^nd use (3.3) and (3.5) to see that 

P,(^(Tnot(n) - T{Q,x) : x G B(n))|T^^o\)(n - 1), U,,){u) 

= P4 n ¥'y(Tnot(n)-aT(j')(a;):xeB(n-l))|T^^ot(n-l),t/,J(a;) 
j/eB(i) 

= \[ W{^y{a{T^ot{n){uj)a-^ - T{x)) : a; e B(n - l))\T{n - 1) > Tnot(n)(a;)«-i) 

j/GB(l)\{«i} 

X P.(<^.,(a(TS(n - 1) - Tl^'^lix)) : a; e B(n - l))\Tl^'^l{n - 1), U,,){u) . 
Recall the notation ^p^'^\u) = ^p{au). The previous line now becomes 

^MTr^ot{n) - T(0,a;) : x e B(n))|T(^J^)(n - 1), U,,){u) 

= { n [KB(n-l),n-l,Tnot(n)(a;)a-i)(vpi"))]} 

yGB(l)\{2i} 

X P.(v^irnrS(n - 1) - T^^ot (x) : X G B(n - l))|TS(n - 1), U^M^) . 
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Now use (3.4) to equate the above to 

{ n H^^"" - 1), n - 1, Tnot(n)(a;)a-i)((^("))] } 

yen{i)\{zi} 

X (v^S^T^^ot (n - 1) - T^ot (x) : X e B(n - l))|T(^ot (n - l))(a;) . 

Use the identity F'^,{{T^^^^{x) :xeM)e-)= P^/((T(0,a;) : a; G B) e •) (in words: looking in the 
subtree from zi conditioned on zi(Bz' being the first of its generation reached among the subtree 
is the same as looking in the whole tree conditioned on z' being the first in its generation) to 
get 

P.(^(^not(n) - T(0, x) : x e M{n))\T^'^^ {n - l),U,,){iv) 

j/€B(l)\{2i} 

( Z ^ 

Condition both sides of the above with respect to cr(Tnot('^)) which is contained in a{Tj:^Q^, Uz^)\/ 

{Pz-nuU sets} and use (3.5) together with the independence of Uz^ and T^Q^{n — 1) to conclude 
that 

Fz{<p{Tnot{n) - T{0,x) : x e B(n))|Tnot(n)) (a;) 

/■^not(")('^)/" r 

= / [l^{z',s)x (g) zy(B(n-l),n-l,Tnot(n)(a;)a-') 

X (7^) exp(-((Tnot(n)(a;)/a) - s))F,{T^'^^{n - 1) G ds) 
'•T'not(")('^)/« 



X ( / exp(-((Tnot(n)(a;)/a) - s))¥z{T^'^^{n - 1) e ds))'' 

Jo 

/ [f^{z',s)x (g) iy(B(n-l),n-l,Tnot(n)(a;)«-i)J 

/''^not('^)('^)/'-' 

X (^(«))e^P,(T^^^\)(n -l)eds){ e^P.(7^S(" " 1) ^ ^^^))"'- 

^0 



Let ji{z,t){i~p) be defined by the above expression with t in place of rnot("')(<^)- The above 
shows that {jJi^z^ t) : t > 0} are regular conditional probabilities for the required conditional 
distributions. By induction, there is a version of the measures iJi{z' , s) that is stochastically non- 
decreasing in s, while Lemma 3.1 shows that (S'y6B(i)\{2i} '^(18(n— 1), n— 1, ta~^) is stochastically 
nondecreasing in t. Thus for each bounded, nondecreasing function, (p on M'^'^"^), ii[z, t){(p) may 
be written as a weighted average on [0, t/a] of a nondecreasing function: 

t/a 

^|;{s,tMdsh{[0,t/a])-\ 

where i/j is nondecreasing in each variable (combining the effects of n{z' , s) and the measures 
z^(B(n — 1), n — l,t/a)), and 7 is the locally finite measure given by the term times the density 
of T^Q^{n — 1). Such a weighted average is clearly nondecreasing in t. □ 
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Corollary 3.4. If ^ > and n e N, there is a nondecreasing version of 



/C„(t)=P( ^ exp(-^(T(0,a;)-Tnot(n)))|Tnot(n) = t). 

Proof. For ^ e B(n), let 

/Cn,.(t)=P( Yl exp(-^(T(0,x)-Tnot(n)))|First„ = 2;,Tnot(n) 

a;eB(n) 

Since the sum is increasing in each Tnot(^) -~ T{0, x), Lemma 3.3 implies that each }Cn,z{t) has 
a nondecreasing version. But by symmetry, ]Cn,z is independent of z, so is equal to /C^ almost 
surely. □ 

The following elementary result is proved by an integration by parts and is stated for future 
reference. 

Lemma 3.5. Assume p, g : M ^ [0, oo) with p non-decreasing and q non-increasing. Then for 
any random variable X, ¥{p{X)q{X)) < P(p(X)) P(?(X)). □ 



4. Bounds for the Clusters 

As we saw in the proof of Theorem 2.6, the key in establishing the linear growth of h{An) 
is a good bound on the size of each cluster off the backbone. These clusters are governed by 
the laws {I'ti't > 0) of C{t) conditioned to still be finite at time t (Theorem 2.5). The main 
results of this section are Theorems 4.4 and 4.6, which give uniform bounds on vt{ij^{C)) and 
vtmCf) for t > 0. 

Notation. Set H{t) = P(T(oo) > t) and G{t) = e^H{t). 
Lemma 4.1. (a) The functions G and H satisfy 

nt/a 

(4.1) H{t) = e-*(l + a / e'^''H{uYdu), t > 0, 

Jo 

(4.2) G{t) = l + a [ e-''^'^-°''^G{uYdu, t > 0. 

Jo 

(b) G{t) increases to a finite limit 04.1(0;, d) as 1 1 00 and 

1 < C4.i(Q;,ci) < exp (log((i/(d — Q;))/(l — a)). 

Proof. Condition on Uq and use scaling to conclude that 

H{t) = e-^+ I e-'H{{t- s)/afds. 
Jo 
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Setting u = (t — s)/a gives (4.1), and (4.2) is then immediate. Equation (4.2) shows that 
G{t) increases to a (possibly infinite) hmit C4,i{a,d) > 1 as t ^ oo. Lemma 2.3 shows that 
H{t) < s{a)e~^ and therefore H{t) < e~*^*~*°) where to = log(s(Q;)). Hence from (4.1), 



H{t) < e-* + ae-' [ " e'^'^du + l{t/a > to)ae-' [ ' e'^'' e''^^'^'^^^ d% 

Jo J to 



iu 

'to 

< e'^*"-* + a{d - a)-ie-*+'^*o-('^-")*o = e-^*-*'\ 



where ti = h{to) := ato + log{d/ {d — a)). Iterating this procedure, we obtain H{t) < e ^* 

for n > 1, where t^+i = h{tn) and hm„_»<x) tn = \og{d/{d - a)){\ - a)-'^. Let n — > oo m 

G{t) < e*" to complete the proof. □ 

Remark. (4.1) and (4.2) are rather nasty equations since they are non-linear and (worst of 
all) anticipative. (See Athreya (1985) for some similar equations, arising from the distribution 
function of the random variable sup^^bT(x).) Solutions to these equations are not unique 
because H = 1 also satisfies (4.1). It is, however, not hard to show that P(T(oo) > t) is the 
unique non- increasing solution H to (4.1) for which e'H{t) is bounded. Although it seems 
difficult to get sharp estimates from these equations, in the next section some closely related 
equations will help us analyze the process {Wn, n G Z_|_), and in Section 8 we will derive some 
asymptotic results as a j 1. In particular, the upper bound on C4.1 in (6) is by no means optimal 
- see Remark 8.3. 

Notation. If n is a non-negative integer, define a Laplace transform with respect to the real 
variables 6 e [0, 1] and 7 > 1 by 

r(n,^,7) = P( Yl exp(^Tnot(n)-7«"""'(T^St(^)-^not(n)))),neZ+. 

a;6B(n) 

Let 

do = ld{d-l) - 1. 

For 7 > a, n e N and < 5 < t, let 

S{n,^,5,t)= J2 <T{x) <t)exp{-^a-''-\t-T{n))). 

To get a feeling for S, set 6 — t, and integrate over t to get 

/•oo 

/ S{n,^,t,t)dt={a^+'/^) J2 exp(-7a— HT(x)-r(n))). 

x€B{n) 

This is 0(0;'^+^), provided that not too many times T{x) — T{n) are near on a scale of ct". 
The next result gives the critical technical estimate on r, which is then used to show that 5" is 
indeed 0{a^) when 7 > a + 1. This will in turn lead to Theorems 4.4 and 4.6 on the respective 
and behaviours of the cluster size. Recall that s{a) — 11^^1(1 ~ a^)~^. 
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Lemma 4.2. For all n E Z+, 7 > 1, 

r(n, 1,7) < C4.2(q!, 7,d) := (f{d - 1)~^(1 - a)s(a)'^ exp(do/ ('y(l - a))). 

Proof. The result is trivial if n = 0, thus assume n > 1 and fix 7 > 1. For the induction, assume 
the result for all n' < n and all 9' (what is actually needed is 9' = aW). Fix zi G {1, . . . ,d} and 
set r = min{T*^^^(n — 1) : y € \ {^i}}- Using the symmetry of the tree when computing 
r(r7., 0, 7), we may sum over only those x with xi — zi and then multiply by d. This leads to 

r(n, ^, 7) = • P( Yl (^^not(n) - 7«"""'(«t^.i + «T^ot (^) - aT(^i)(n - 1)) 

a;GB(n-l) 

- 7«-"-'(«T(^^Hn - 1) - TnotH))) 

= d • P( ^ exp (^Tnot(n) - 7«-"(rS(^) - " 1)) 

a;GB(n-l) 

- 7«-"(T(^^Hn - 1) - a-'Tnotin)))) • 

Divide the above expectation into two terms corresponding to the events {T^^^\n — 1) > r} 
and {T(^i)(n - 1) < r}. Use the fact that on the latter event T^f^iin - 1) = a-^Tnot{n) while 
on the former event r = a~^TQoi{n) to see that 

r(n, ^, 7) = /i + where 



d-F\l{T^^^\n-l) > T)exp(6'Q;T-7a-^(T(^i)(n- 1) -r)) 
X Y exp(-7a--(TS(x) - T^^J^^ln - 1)))), 

x€M(n-l) 

d ■ F(l(T^^^\n - 1) < r) exp(^ar(^i)(n - 1)) 

X eM-lo^-'iTi.liix) - t'^lin - 1)))) . 

a;eB(n-l) 

The plan is to bound these terms by constant multiples of r{n — 1,Q!6',7) and then apply the 
induction hypothesis. (In fact Ii will be of smaller order). 

Consider h first. The term l(T(^i)(n - 1) > r) exp(^ar - 7a-"(T(^i)(n - 1) - r)) wiU 
make this term relatively small as n becomes large; heuristically, the typical difference between 
T(n) and T{x) will be of order 1 when xi 7^ ^i, and so the factor of in the exponent makes 
these terms small. To verify this, fix Z2 G -B(l) \ {^1} and set R — T^^^in — 1) — T'^^^{n — 1). 
Focus on the expression in front of the sum in Ii and integrate over the pair (C/^^, C/^j) 



(4.3) 
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that 

P(l(T(^^)(n - 1) > r) exp(^«T - ^a-^{T(^^\n - 1) - r))|^(,,,oo)) 
= {d- 1)P(1(C/., + T^^ot (n - 1) > r = C/,, + T^^^^^^n - 1)) 

X eMMUz, + T^ot (n - 1)) - 7«""(t^.i - U,, + R))\J^^,^,^)) 

< (d - 1)P( [ / / l(w2 - wi < i?) exp(6'aw2 - 'ya~''{ui - W2))e""^ dui 6""^ du2] 
^ Jo Jo 

X exp(^aT^J^^(n - 1) - 7«-"i?)|:^(.„oo)) • 
First integrate U2 over (0,tti + R) and then integrate iti over ((— i?)+,oo) to bound the above 

by 

{d - l){ad + 7^-^ - l)-^(2 - ae)-^ 

X P(exp(^aT^"Q=^^)(n - 1) - (2 - + (aO - l)i?)| 

= (d - l)a"(7 + - a")-\2 - aO)'^ exp(6'aT^^\.^ (n - 1)) 

X P(exp(-(2 - ae){-R)+ - R)\J'i,„oo)) ■ 

Note that -(2 - ae){-R)+ - R<0 for a,d E [0, 1], and so this is bounded by 

{d - 1)q!"(7 - q;")-^(2 - aO)-^ exp{eaT^'^^{n - 1)) . 

Substitute this bound into the expression for Ii (the summation being jF(2^ oo) -measurable) to 
conclude that Ii is at most 

d{d - l)(2-a^)-ia^(7 - a^)"^ 

(4.4) E ^MO<^T^'^i{n-l)-ia--{Ti,:^^{x)-Tl^:^^{n-l))^ 

xeB(n-l) 

= {do + 1)(1 - a6'/2)-^a^(7 - a")-V(n - 1, aO, 7) . 

Consider now 12- By symmetry, as 9 becomes small the terms in front of the summation in 
the expression for I2 should have mean close to 1, and so I2 should be close to r(n— 1, aO, 7). This 
is true, but to make the argument rigorous we must control the possible correlations between 
the summation and the remaining terms in the integrand. This makes use of the monotonicity 
results from Section 3, and in particular Corollary 3.4. 

Let q{t) — F{exp{9aUzi)l{Uz^ + t < r)) and note that q is decreasing in t. Use the 
independence of J-(zi,oo) ctnd D cr(^2i, t) to see that 

P(l(T(^^)(n - 1) < r) exp(^aT(-^)(n - l))|.F(,„oo)) 

(4.5) = P(1(C/,^ + T(^J,)(n - 1) < r) exp(^a(C/,, + T^'^^in - iMT^^^iin - 1)) 
= exp(^aT(^J,)(n - l))g(TS(n - 1)) . 
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The joint independence of r, t/^^ and T^Q^^-'(n — 1) shows that 

q{u) = F{U,, + T^'^l{n - 1) < t\U,, = u) 
is decreasing in u. The same independence and Lemma 3.5 give 

.4 n<li4oti^ - 1))) = neMOaU^MU.,)) 

< P(exp(^at/,J)P(Q(t/,J) = (1 - 9a)-^d-^ 

by symmetry. Using (4.5) in the expression for I2 we get 

h = ciP(g(T^^ot (n - 1)) eMOaT^'^iin - 1)) 

X P( eM-lo^-'^i&x) - T^^ot (n - l)))|T(-)(n - 1))) . 

x€M(n-l) 

Corollary 3.4 shows we may assume the conditional expectation is a nondecreasing function of 
-^not ~ ■'■)• R-ecalling that q is nonincreasing, we again use Lemma 3.5 to conclude that 

h < rfP(g(TS(n - mHeMOaT^^l'^in - 1)) 

X Yl exp(-7«-(T^^„\\a:)-T^^J,)(n-l)))) 

x€n{n-l) 

by (4.6). 

Combine the above with (4.3) and (4.4) to see that 

r(n, e, 7) < (1 - aO)-^ (l + {do + l)a'^(7 - a'')"^) r(n - 1, aO, 7) 
= (1 - a6')-^ (1 - a"7-^) "^(1 + do^~^a'')r{n - 1, aO, 7). 

We now use induction, recalling that 7 > 1, to conclude that 

n— 1 n n 

r{n, 1, 7) < [ n (1 - n (1 - «'7"')"' n (1 + ^07-'«')] r(l, 7) 

^ fc=l fc=2 fe=2 

< s{af{l - a) exp(do7~^(l - a)~^)r(l, 7) . 

Note that 

r(l,a"-\7) < dP(exp(a'^-^Tnot(l))) < rfP(exp( min Ua:)) = d^/(d-l), 

xeM(i) 

and use this in (4.7) to complete the argument. □ 
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Lemma 4.3. For 7 > 1 + a, n e Z+, <6 <t, 

¥{S{n,^,6,t)) < e-*C4.2(a,7-a,d)(a"+^ A5). 

Proof. Let 7,5, t be as above, and let n e Z_|_; for x G B(n) we evaluate the corresponding 
summand in S{n, 7, S, t). First, integrate out Uq to see that 

F(l(^ -S< T{x) < t) exp(-7a-"-^(t - T(n)))) 

= ¥{l{T^%{x) < t) / "° exp(-« - 7(t - « - T(°\(a;))a-"-i) d« 

X exp(-7(T(°\(a;) -Tnot(n))a— ^)) . 

Change variables to s = t — u — T^^^{x) and bound the Lebesgue integral from above on the 
event {T^°\(a:) < t} by: 



I 



<5A(t-T(°\(x)) 

exp(-(7a— 1 - l)s) dsexp(-(t - T^°\(a;))) 



<a"+i(7 - - exp(-(7a— 1 - 1)5)) exp(-t + t(°\(x)) 

<e-^Q;"+\7 - a"+^)-^min{l, (70;-"-^ - 1)5} exp(T^^t(^)) 
<e-* min{a"+i(7 - a)-\ 6} exp(T(°\(a;)) . 

Thus 

P(l(t - 5 < T(a;) < t) exp(-7Q;-"-^(t - T(n)))) 

< e-* min{a-+i(7 - a)-\ 5}P(exp(Tnot(n) - (7«-"-' - miSti^) - TnotW))) 

< e-*min{a"+i(7 " a)-\ S}F{cMTnot{n) - (7 - a)a^"-^(T(J,\(a;) - TnotH))) • 
Summing over x in M{n) and recalling that 7 > 1 + a, 

P(5(n,7,5,t)) < e"*min(a"+^(7-a)-\5)r(n, 1,7-a) 

< e~^C4.2(Q;,7 - Q!,(i)(Q;"+^ A 5) (by Lemma 4.2). □ 

Notation. Let 04.3(0;, d) = 04.1(0;, (i)''c4.2(o;, d — a,d){l — a)~^. 
In the following result we set C(0— ) = 0. 

Theorem 4.4. (a) For all < 5 < t and n in Z_|_ 

P(#((C(t) - C((t - 5)-)) n B(n))|T(oo) > t) < 04.1(0;, d) V2(ci;, - o;, (i)(a^+^ A 5). 

(b) For all < 5 < t, 

P(#(C(t)-C((t-5)-))|T(oo)>t) 

< C4.i(a,rf)^C4.2(a,ci-a,rf)(l -a)"^(log+(l/5) + 1)(5A 1) 

< C4.3(a,<i), 

and so in particular, P(#C(t) |T(oo) > t) < 04.3(0;, d) for all t>0. 

(c) P(#(C(t A T(oo)) n B(n))) < C4.3(a, d){l - e"*) for alH > 0, n G Z+. 
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Proof. Clearly T(oo) < T(n) + T(Firstn,oo) and ii z e B(n) then Lemma 4.1 shows 

(4.8) P(T(^,oo) > =P(T(oo) > ta-''-^y < C4.i(a, d)^exp(-(ito-"-^). 

Therefore, 

HmC{t) - cat - 6)-)) n B(n))l(T(oo) > t)) 

= J2 At-S< T{x) < t < T(oo)) 

x6B(n) 

< ^mt-S <T{x) <t)F{T{F\rstn,oo)>t-T{n)\J^n)) 

x6B(n) 

< C4.i{a,dfF{S{n,d,S,t)) (by (4.8)). 

Lemma 4.3 therefore shows that 

P(#((C(t) - C{{t - S)-)) n B(n))|T(oo) > t) 

(4.9) < C4.i{a,dyc4.2{c(,d-a,d){a''+^ AS){e*F{T{oo) > t))'^ 

< C4_i{a, d)'^C4,2{a, d - a, d){a'^~^^ A 5). 

This proves (a), and (b) follows upon summing over n G 
(c) As in (a) we have 

P(#(C7(t AT(oo))nB(n))) < ^ P(l(T(a;) < t)P(T(First„, oo) > T(a;) - T(n)|.Fn)) 
< J2 P(l(r(a;) < t)c4.i(a,d)^exp(-d(r(a;) -T(n))a-"-^)) (as in (4.8)). 

Integrate out Uq to see that the last summation equals 

C4.M,dr P(exp(-d(T(St(a;)-Tnot(n))«---^)l(T(«Jt(a;) <t)[l^ 

a;eB(n) 

< C4.i(a, rf)^(l - e-*)r(n, 0, rf) 

< C4.i(q;, (i)^C4.2(Q;, (i, (i)(l — e~*) (by Lemma 4.2) 
<C4.3(«,c^)(l-e-*). □ 

Remark 4.5. If a; e (0, 1) and N = [(log l/a;)/(log 1/a)] then 

N oo 

s(a) < |]^(l-a)-^(l + ... + a^-^)-^}exp( ^ a^-(l " a^T') 

1 k=N+l 

N 

< (1 - a)-^ n^'^^)"' exp(x(l - x)-^(l - a)-^). 

k=l 
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Use Stirling's Formula and optimize over x to see there are constants, C4.4, C4.5 > such that 

(4.10) s{a) < C4.4exp (04.5(1 - a)~^) for all a e (0, 1). 
Using (4.10) and Lemma 4.1(b) it follows that 

€4.3(0;, d) = C4.i(q;, (i)^C4.2(a, d — a, d)(l — a)~^ 

(4.11) < cl^d''{d - 1)-^ exp ((2c4.5 + (do/id - 1)) + dlog(d/(d - 1)))/(1 - a)) 
= C4.6{d) exp(c4.7(rf)/(l - a))- 

The Central Limit Theorem in Section 6 will require the following bounds on the con- 
ditioned clusters. 

Theorem 4.6. There is a C4.8(a,(i) > such that 

F{#C{tf\T{oo) >t)< C4.8{a, d) for aU t > 0. 

Proof. (In this proof, we suppress the dependence of €4.^ on (a, d)). Let denote summation 
over a^i, 0^2 in ® for which xi A X2 is distinct from both xi and X2- Then 

(4.12) P(#C(t)2l(T(oo) > t)) < 2P( ^ J] l(a;2 > a;i)l(T(a;2) < t < T(oo))) 

+ P( J]\(T(a;i) V T(a;2) < t < T(oo))). 

If y 7^ let T(y) = inf{T(x) : = 7^ y} and let Z{y) be the a.s. unique vertex in 

B(||/|) — {y} at which this minimum time is attained. Define T(0) = 00 and Z(0) = 0. Note that 
(T'(j/), Z(y)) is I -measurable and T(Z(j/),oo) is f^-measurable (recall from the beginning of 
Section 2 that £y is the "wide-sense" past up to y). In the second summation in (4.12), write 
x\ = V (B i ® yi and X2 = v (B j ® y2 where v = xi /\X2 and i ^ j & {1, . . . ,d} to get 

fC^1{T{xi) V T{x2) < t < T(oo))) 

< P| J]] l{T{v) < t < f(v) + T(Z(v), 00)) 

xP[^ ^ (^^l(T(^;)+T("®^)(2/i)a'"l+^ <t <T(^;) + T(^®^)(oo)al^l+^)) 
X 1(T(^;) + T("®^)(|/2)al"l+' < t < T(^;) + T("®j)(oo)al"l+i)) | }. 

(T^^®*)(-), T(^®-^)(-)), i ^ j, are independent and are jointly independent of £y. The above 
therefore equals 

d{d-l) I Y,^{T{v){oo)<t)¥{T{Z{v),^)>t-f{v)\:F\,{){uj) 

X [P(#C((t-T(z;)(c^))a-|"l-i)l(T(oo) > (t - T(z;)(u;))a-I^l-^))]^ dP(a;) 
d(d - 1) / J] 1(7^(^)(^) < t)P(T(0, 00) > (t - f (^;)(a;))a-l^l) 

X cl3[P(T(oo) >{t- T{v){oj))a-\''\-^)f d¥{oo) . 
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(4.13) 



< 



The time T(0, oo) is equal in law to the minimum of d independent copies of aT{oo). Lemma 4.1(c) 
and the fact that T{\v\) = min{f (v), T(v)} therefore show that iiT{v){uj) < t, then 

P(r(0,oo) > (t-f(^;)(a;))a-l"l)[P(T(cx)) > {t - T{v){u))a-\''\-^)]^ 

<cf+2 exp { - - f{v){oj))+ - 2Q;-l^l-^(t - T{v){u;))} 

<cf+^ exp { - 2a-H-i(t - T(H)(a;))} . 

Substitute this into the RHS of (4.13), thus bounding the second term on the RHS of (4.12) by 
which is equal to 

oo 

d(d-l)ci3C^+¥(J]5(n,2,t,t)) 

n=0 

and hence at most c{a,d)e~* by Lemma 4.3. 

The first term on the right side of (4.12) equals 

CO 

2F{J2 \x\liTix) < t < T(oo))) = 2P(^ ni^{C{t) D B(n))|r(oo) > t)P(r(oo) > t) 

X n=l 

< c'(a, (i)P(T(oo) > t) (by Theorem 4.4(a)). 

Use this and the above bound in (4.12) to conclude 

P(#C(t)2|T(oo) >t)< c'{a,d) + c{a,d){e^¥{T{oo) > t))-^ 

< c'{a,d) + c{a,d) . □ 

5. The Process W 

Recall from Section 2 that Wn = T(spine^, oo)a~'^~^ is the rescaled time to percolate along 
the backbone from generation n to cxo, and W = {Wn : n e In this section we will show 

W is an ergodic Markov chain which stochastically decreases to its unique invariant measure. 
Moreover there is an exponentially fast coupling mechanism for the chain. 

Notation. For each a; e B, set 

W{x) = T{x, oo)a-l^l-^ = a-^T^%{oo) . 

Let F denote the c.d.f. for Wq. Then W{x) is equal in law to Wq for each x E M; since Wq is 
the minimum of d independent copies of T(oo), Lemma 4.1 shows that 

(5.1) 1 - F{t) = F{Wo >t)< C4.i(a, d'fe-'^K 



28 



Hence we can define a law on (0, oo), which "tilts" Wq to the right by a: 

A{A) = P(e"^oi(W^o e A))/F{e'^^°) . 

Write A{t) = A{{0,t)). 

To define the filtration on which the sequence W is Markov, we introduce two more pieces 
of notation. 

Notation. Let Tx = cr(VF(a;)). Then x <y implies Tx C Ty and if a; e B(n), then 

{spine„ = x)= {T{x) + a'^^^W^x) < min{T(a;') + a"+^Vr(a;') : x' e B(n) - {x}}} e JT^. 
Therefore spine^ is an "jFj.-stopping point" and we may define a filtration {Wn} by 

= ^spine„ = {Ae T : An {spine^ = x} e for all x e B}. 
The verification that {Wn} is adapted to {Wn} is immediate. 

Theorem 5.1. (a) W = {Wn : n e Z+} is a (>Vn)-Markov chain such that for Borel sets 
B C (0,oo), 

FiWn+i e B\Wn) ^ p{B\Wn) :^ A{B\{0,Wn/a)) . 
(b) {Wn,n e Z+) is stochastically non-increasing. 

Proof, (a) Let A e Wn, x e B(n + 1) and B a Borel subset of (0, oo). Then 

(5.2) HWn+i e 5, A, spine„+i = x) 

= P 1{A, spine^ = x\n)F{W{x) e B, 

Ux + aW{x) < ^^min_^ t^sibi(a;) + aW{s\bi{x)) | ^x\n) ■ 

Both VF(x|n) and the event in the conditional probability are ^^Q^-measurable. Since 

^{x\n,oo) cind £x\n ^^e independent, while ^x\n = ^x\n V a{W{x\n)), it follows that (5.2) equals 



(5.3) 



1(A, spine^ = x\n)F{W{x) e S, 

Ux + aW{x) < ^mm_^{Us\b.^x) + aW{s\hi{x))) \ W{x\n)) 

Let B(l) = {zi, . . . , Zd}, and set 

q{-\w) = d-F{W{zd)e-,U,, + aW{zd)< min {U,^+aW{zi)) 

l<i<d—l 

l<z<a 
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In other words, q{- \w) is a regular conditional probability for the right side. The collections 
{W{x), U^, {{W{sMx)), C/sib,(x)) ■■i<d)) and {W{zd), U,,, {{W{zi), U,,) : i < d)) are equal in 
law, and 

W{x\n) = mm{Ux + aW{x), Us\\:>^(^x) + o:W{s\bi{x)) : i < d}. 
Therefore, using this in (5.3), we deduce that 

F{Wn+i e B,^,spine^+i = x) = d-^P(l(A, spine^ = x\n)q{B\W{x\n))) 

= d~^P(l(A, spine„ = x\n)q{B\Wr,)). 

Sum over a; e B(n + 1) to conclude 

(5.4) F{Wn+ieB\Wn) = q{B\Wn) a.s. 

If h{y) = P(mini 

<i<d-i{Uzi + o^W{zi)) > y) then clearly for A e B((0,oo)), 
dF(W{zd) e B, min {U,^ + aW{zi)) e A, U,^ + aW{zd) < min {U,^ + aW{zi))) 

l<i<d l<i<d—l 



= dj J l{w e B,u + aw e A)h{u + aw)e '^dudF{w) 
= dj Jl{weB,ye A)l{w < y/a)h{y)e-y+'^'^dF{w)dy 
= I ^{y)A{B\{0,y/a])dy, 

J A 

where ip{y) = dh{y)e~yA{y/a). Take B = (0, oo) to see that ifj is the density of 
m.mi<d{Uz^ + aW{zi)). It follows that A(- \{0,y/a]) is a version of g(- \y) and (a) is a con- 
sequence of (5.4). 

(b) We have 

P(Vri >t) = dF{l{W{zd) > t)¥{Uz, + aW{zd) < min (U^, + aW{zi))\W{zd))) 

l<i<d 

< dF{W{zd) > t)F{Uza + aW{zd) < min (C/^, + aW{zi))) (Lemma 3.5) 
= F{W{zd) > t) (symmetry) 

= nwo>t). 



This proves Wi -< Wq. The result now follows by induction and the fact that p{- \w) is stochas- 
tically non-decreasing in w. □ 

Since Wn decreases stochastically in n, the laws converge weakly, and we shall now show 
that the limit is nontrivial. The other goals for the remainder of this section are to understand 
the stationary distribution for {VFn} (including existence and convergence to stationarity) and 
to prove an exponential rate of convergence through a coupling mechanism. The first goal is 
achieved in the next result. 

Theorem 5.2. (a) W = {Wn : n G Z_|_} converges to a stationary distribution n on (0, oo). 
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(b) TT is the unique stationary distribution for W and the stationary chain is ergodic. 

(c) There are positive constants C5.i(p)(p > 0), and C5.2 such that for all > and for all p > 

^{{0,w])<c,.i{p)wP 
7r(Koo))<C5.2e-'^"'. 

We begin the proof with an integral equation that leads to an important bound (5.9) on 
the left tail of F. Let Fi{t) = P(T(oo) < t) and g{x) = !-(!- x^. Wq is the minimum of d 
independent copies of T{oo) and so 

(5.6) F{t) = g{F,{t)) e [F,{t),dFi{t)]. 
By conditioning on Uq we also have 

(5.7) Fi{t) = ae-^ / e'''F{s)ds < F{t/a){l - e"^). 

Jo 



Equations (5.6) and (5.7), together with the easy fact that Fi{t) > for positive t, imply that 
F and Fi are infinitely differentiable and satisfy 

pt/a 

(5.8) Fi{t) = -ae-* e'''F{s)ds + F{t/a), F'{t) = g'{Fi{t))F[{t), 

Jo 



(5.9) 0<F{t)/F{t/a)<d{l-e-*) <td for aU t > 0. 

Next, we let /3 be the midpoint between 1 and a~^, and show that when Wn is small, the 
probability is at least 1/4 that VFn+i > Wnf3. 

Lemma 5.3. (a) There is a to > such that F{t(3)/ F{t/ a) < | and A{t(3)/A{t/a) < | for 

te{o,to). 

(b) For all < w; < 1, A{w)/A{w/a) < dew. 
Proof. Differentiate in (5.8) to see 

(5.10) F"{t) = 9"{F,{t))Fi{tf+g'{F,{t))Fi'{t), 

and as t J, 0, 

F{'(t) = o{F{t/a)) - F{t/a) + F'{t/a)a-^ 

= o{F{t/a)) - Fit/ a) + g' {Fi{t/ a))F[{t/ a)a-^ 

= o{F{t/a)) - F{t/a) + g\Fi{t/a))a-\o{F{t/a'^)) + F{t/a^)) (from (5.8)) 
= o{F{t/a^)) - F{t/a) + da-^F{t/a^). 
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Use the latter in (5.10) to conclude that as t J, 



F"{t) = o{F{t/a)) + d{o{F{t/a^)) - F{t/a) + da-^F{t/a^)) ((5.8) again) 
= o{F{t/a^)) + (fa-^F(t/a^) - dF{t/a) 
> for small t. 

Therefore F is convex near and for i e (0, to) we have 

{F{t/a) - F{tp))/F{t/a) > \{F{t/a) - F(t))/F{t/a) >\{l- dt) (by (5.9)) 

> I (for to small enough). 

For the second inequality in (a) note that for t G (0, to), 

A{tp)/A{t/a) < F{tp)e^/F{t/a) < |, 

where we have taken to sufficiently small for the last inequality. This proves (a), and (b) is a 
trivial consequence of (5.9), just as above. □ 

Proof of Theorem 5.2. (a), (c). The stochastic monotonicity in Theorem 5.1 shows that (W^) 
converges in distribution to a law tt on [0, oo). We must show 7r({0}) = 0. If to > is as in 
Lemma 5.3, then for w e (0, to A 1], 

<W)< P(W^n+l < W) 

= F{A{{Wr,/a) Aw)A{Wn/a)-^) 

< < /3-^w)+F{l{/3-^w <Wr,< w)A{pWn)A{Wn/a)-^) 
+ F{l{Wn > w)A{w)A{w/a)-^) 

< F(W„ < + l¥{p-^w <Wn<w)+ dewP(Wn > w), 

by Lemma 5.3. Rearrange terms to conclude from the above that 

¥{l3~^w <Wn<w) < Adew = cw for w e (0, to A 1]. 
Iterating the above we find that for k > ko, 

oo 

P(0 <Wn< f5~^) = c(3-^ = c(l - /3-^)-^/3-^ 

j=k 

and therefore 

(5.11) P(0 <Wn<w)< cp{l - P~^)~^w = C5.1W for < -u; < wq. 

Let n — cxD to get the first inequality in (5.5) for p = 1, first for w a continuity point of 7r((0, w]) 
in (0, Wq), and then (increase C5.1 if necessary) for all w > 0. This proves the first part of (c) for 
p = 1 and also shows 7r({0}) = 0. Since F is atomless, so is A, and p{- \w) is weakly continuous 
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in w. Taking limits in P(VF„_|_i G •) = P(p(- IVF^)), we see that tt is a stationary law. The upper 
bound on tt{[w,oo)) is immediate from (5.1) and n < F (again increase C5.1 if need be). 

To prove the upper bound on the left tail of tt for general p we consider p = 2. The induction 
argument which will give the result for general p in M will then be clear. The fact that tt is 
stationary implies that for w G (0, to), 

7a Am;) 
^{[0,w])= / ^" ^ d7r{x) 

< TT {[0,P-^W]) + j 1{(3-^W <X<W) dTT{x) 

< ^ ^ a > + / 1< a < ^> xliy <'^(^>' 

and therefore, 



a • 

- + (de)^ — 
a, a. 



< I TT ((/3-'t(;, to]) + (det(;)c5.i(l)- + (rfe) 

using Lemma 5.3 and the p = \ case. The above implies 

TT ((/?~^w, w]) < 4 ((iec5.i(l) + {def) a'^w 



and the result for p = 2 now follows as for p = 1 in the above. 

(b) (5.8) and (5.9) show that F[{t) > e-^F{t/a) > for all t > 0. This and the second part of 
(5.8) imply F'{t) > and hence A'(t) > for all t > 0. Therefore p{- \w) has a strictly positive 
continous density on {0,w/a]. It follows easily that {W^) is an indecomposable Markov chain 
and (b) is a consequence of (a) and Theorem 7.16 of Breiman (1968). □ 

There is a natural coupling technique for W which, in addition to refining the above 
convergence result, will also play an important role in the limit theorems of Section 6. Let 
D = {(tui, 102) : wi > W2 > 0} and define a Markov kernel on D by (set 0/0 = 0) 



p{Ai X A2\{wi,W2)) = / lAixA2{x,x)dA{x)A{w2/a)~^{A{w2/a)/A{wi/a)) 

Jo 

+ / \AAx\)dA{xx){A{wxloL) - A{w2Iol))-^ 

J W2 I OL 
l'W2/ct 

X / lA2{x2)dA{x2)A{w2/a)-'^{l-{A{w2/a)/A{wi/a))). 
Jo 



li Pi{- \{wi,W2)) is the z*'* marginal oi p{- \{wi,W2)) {i — 1,2), then 
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pi{Ai\{wuW2)) = I U,(a;)cZA(a;)A(wi/a)-^ + / (a;)cZA(a;)A(t(;i/a)-^ 
= p{Ai\wi), 



and similarly one sees that P2(^2|(wi, ^2)) = p{A2\w2)- We extend this coupling of W to a 
Markov kernel on (0, 00)^ by setting 

p{Ai X A2\{wi,W2)) =P{A2 X Ai\{w2-:Wi)) if W2 > Wi. 

Then p is a Markov kernel on (0, cxd)^ with marginals pi{- \ {wi,W2)) — p{- \wi) and so the 
induced Markov chain {{W^,W^),n e Z+) will be a coupling of (Wn). Clearly Wq > Wq 
(respectively, Wq < Wq) implies > (respectively < W^) for all n > a.s., and if 
Tcouple = min{n : = W^} then = for aU n > Tcouple a.s. 

If Wq > Wq, the chains will couple at the first time n for which W^ < W^_i/a and 
hence it is possible for W^ to jump onto W^. Unfortunately if W^_i is small the probability of 
W^ < W^_i/a will be small, and to get a good coupling rate we must bound the time spent by 
W^ in (0, S] for 5 small. We start with a stochastic lower bound on the left tail of the Markov 
kernel of W. 

Lemma 5.4. There is a wq E (0, 1] and a probability p on (0,0!"-'^] such that 

(5.12) p((0, wx] \w) < p((0, x]) for all a; > and w e (0, wq], 

(5.13) J log xdp{x) = m > 0, 

(5.14) p((0, x]) < wodex < X for all a; < a^. 
Proof. Let wi — {2de)~^ and define p on (0, wi] x (0, 1) by 

n-1 

p{w, x)^Y[ {dea^w) = (cZew)^a^(^-^)/^ if a" < x < n G N. 

Then p is non-decreasing in each variable, p < 1/2, p{w, ■) is right-continuous, and 
p{w, 0+) = = p{w, 0). Let to be as in Lemma 5.3, < wq < wi A to, and define 

{p{wo,x) ifa;<l, 
I if 1 < x < /3, 

1 ifp<x<a-\ 

p is the distribution function of a law (also denoted by p) on (0, a~^]. If {x, w) e (0, 1) x (0, wq] 
and n eN satisfies a'^ < x < a^~^ , then 

n-1 



k{xw) < A(q;" ^w) < Y[ {dea''w)A{a ^w) ( by Lemma 5.3(b) ) 

k=0 

= p{w,x)K{w / a) < p{x)K{w / a). 
34 



Therefore 

p{{0,wx]\w) = A{xw) / A{w / a) < p{x) for {x^w) e (0,1) x (Ojtuo]- 
For {x^w) e X (OjWo] we have, from Lemma 5.3(a), 

p{{Q,wx]\w) = A{xw)/A{w/a) < A{f3w) / A{w / a) < f = p{x), 

which proves (5.12). 

If a" < a; < a^~^ for some n > 3, then 

< {dewo)x^''-^'^/^ < {dewo)x, 

which proves (5.14). 
Finally, 

rl/ot ro? 

\ log xdp{x) > / ([ogx){dewo)dx + {\oga'^)p{wo,l-) + {\ogP)/4: 
Jo Jo 

„ 2 

pa 

= (dewo) / {logx)dx + {\oga^)dewo + (log/?)/4, 

which is positive if we choose wq sufficiently small. □ 

For w > 0, let be the law of {Wn, n G Z+) starting at Wq = w, and for {wi, W2) G [0, cxd), 
let Fw'i_,w2 be the law of ((VF^, W^), n e Z_|_) starting at (VFq , ) = (wi, ^2)- If // is a law on 
(0,00) or (0,00)^, then write for the law of the appropriate chain {W or iW^^W"^)) with 
initial distribution p,. 

Lemma 5.5. For all ^, e > 0, there exist (^5.1, C5.3, A5.1 > (depending on (^,e)) such that 

n 

P„(^ l{Wj < 55.1) > en) < C5_3{w-^ + l)e-^5 i" for all n e Z+, > 0. 
i=o 

Proof. Define a Markov kernel q on (0, 00) by 

Lemma 5.4 implies that g(- |w) -< p{- \w) for all w > 0. This, the stochastic monotonicity of 
p{- \w), and a standard coupling argument (Kamae et al. (1977, Theorem 2)) show that for 
each w > we may construct Markov chains (Wn) and (Xn) with transition kernels p and q, 
respectively, on the same probability space such that Xq — Wq — w and Wn > X^ for all n > 0. 
We abuse notation slightly and let P^, denote the underlying probability. Define a sequence of 
stopping times {Tj,j e Z+) by 

To — mm{k > : Xk > Wq} 
Tj_|_i = min{/c > Tj : X^ > Wq}. 
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Let {Vj : j e N} be i.i.d. random variables, on some .F, P), with law p, and set 

Sn = E7=i^°S^J- ^o(«^) = niin{A; > : wlf-^iVj > wq}, w > 0. The definition of 
q implies that 

^' ^ =F{{{logw + SkATi,k>0),T(,{w))e-)ioTal\w>0. 

(5.14) implies ^{9) = P(exp(-^log Vi)) < oo for ^ < 1 and by (5.13) 7'(0) = -P(logFi) < 0. 
Choose < ^ < 1 sufficiently small (0 < ^ < say) so that ^{9) < 1. Then 

P^„(ro > n) = P(logzi; + Sk< log«;o ^k<n) ( by (5.15)) 

(5.16) <n^M-OSn)){wo/wf 

< w-^-i{9Y ( recall wq < 1). 

The strong Markov property of X now shows that Tj < oo for all j > a.s. 
If 5 > define 

To 

fc=Tj_i + l k=0 

For j e N, the strong Markov property of X implies {Gj = a{Xo, . . . , Xj)) 

(5.17) F^Yj e A|6;t,_J = Px(T,_o(Px(i)(lo e A)) 

= J Fa;{Yo e ^)p(da;|t(;o) ( since > tuo). 

Therefore {1^- : j e N} are i.i.d. and 

^w{Yj) = J ¥^{Yo)p{dx\wo) 

» oo 

< J F{J2 U^ogx + Sk< log5))p{dx\wo) ( by (5.15)) 



fc=0 

oo 



(5-18) ^ 

< ^ / P(e-^^^)5^a;-V(f^a;|t(;o) 



(1 - 7(6'))-^ y x-^p{dx\wo)5^ = c{9)5^ 



where c{9) < oo by (5.12) and (5.14). Use (5.17) and (5.16) to show that for sufficiently small 
positive (3 

P„(e/5^0 = / P.(e^^°)p(da;|«;o) <ef j P,(e^^°)p(da;|«;o) 



< oo. 
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(5.18) implies ^P„(e^^O|;3^o < c{e)5^ and so for some (3 = P{9) > we have 

(5.19) ^w{e^^') < 1 + 2c{e)p5^ < exp{2c{e)p5^). 
Given £ > choose 5 > so that c{e)(3S^ < e/8. Then 

n n 

< ^) > ^n) < P«;(^ l{Xj <S)> en) 

n 

< P„(To + l + ^y^ > en) 

j=i 

< P„(To > en/2 - 1) + exp(-/3£n/2)P»(e^^O" 

< t(;-^7(6')[""/2]-i + exp(-/3£n/2 + 2c{e)pS%) 

( by (5.16) and (5.19)) 

< c(t(;-^ + l)e-^" 

for some c = c{9,e) > and A = X{0,e) > ( by the choice of 5). This proves the result for 
<9 <9o, and it follows trivially for all 6' > 0. □ 

Proposition 5.6. For any ^ > there are C4.4(^), A5.2(^) > such that the coupling time 
''"couple satisfies 

^wi,w2ircoup\e >n)< c^aHwi A W2)~'^ + l)e"^5-2" for all n e Z+, {wi,W2) G (0, oo)^. 
In particular, Tcouple < °° Isiws ji on (0, oo)^. 

Proof. Fix Q < W2 < wi, write P for P«,i,«,2, and let = cr{{Wj,Wf)J < n). Note that 
M{n) = l(Tcouple > '"') 6xp(X^jrQ A{Wj/a)) is a (^„)-supermartingale because w.p. 1. 

¥{M{n+l)\gn) = M(n) exp(A(W^„V«))P(Tcouple >n+l\gn) 

= M{n) exp(A(W^/Q;))(l - A(W^/a)/A(W^/a)) (by the definition of p) 
< M{n). 

If 6* > and £ = 1/2 in Lemma 5.5, then that result gives (for 8 = 5^_i) 

n—l n—1 

P(Tcouple >n)< P(Tcouple > n^Y. ^^^f ^ ^) ^ + ^Y. ^ ^) > 

j=0 j=0 

< F{M{n)) exp(-A(5/a)?i/2) + c^.siw^^ + l)e-^^-^'' 

< exp(A(w2/a) - A(Va)n/2) + c^.siw^^ + l)e-^^ i". 



The result follows because A{S/a) > (by (5.9)) and A{w2/a) < 1. 
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□ 



Corollary 5.7. (a) There are Markov chains {Wn,n e Z_|_) with law Pi? and {Wn,n e Z+) 
with law Ptt, defined on the same probability space, such that if 

"^couple = niin{n eZj^ :Wn = Wn} 

then: 

(i) Wn = Wn for all n > Tcouple, 

(ii) P(Tcouple > n) < Cs.se"'^^-^"' for all n G Z_(_ for some C5.5, A5.3 > 0. 

(b) If Wn — T{X\n, oo)a~^~^ and |i2(VF„) — M denotes the total variation distance between the 
law of Wn and its weak limit tt then — 7r| < 2c5.5e~^^-^'^, n>0. 

Proof, (a) Since F tt by Theorem 5.1(b), there is a law n on D = {(^1,^2) : wi > ^2 > 0} 
with first and second marginals F and tt, respectively. Let {Wn, Wn) be a chain with law P^. 
Then by Proposition 5.6 with ^ = | 

(^couple >n)< c^ac-^'-'^'/^^'' J w-'/^ + ldn{w) < cs.se-^^-^" (by (5.5)). 

(b) is immediate because (Wn) has law Pf- □ 
The same argument as in (a) also gives the following corollary. 

Corollary 5.8. There exists C5.6 > such that 

(5.20) Pttxtt (Tcouple > ^) < c^.ee-^' ''' for aU n e Z+ . 

6. Limit Theorems 

In this section we use the results of the previous sections to prove a Law of Large Numbers 
(Theorem 6.1) and a Central Limit Theorem (Theorem 6.2), for functionals of the clusters which 
branch off the backbone. The next section then lists specific limit theorems that follow from 
these. Recall the setting of Theorem 2.5: the backbone is spine^^, spine2, . . ., the clusters off the 
backbone, and associated percolation times to infinity are given by Ynj = (Clust^^j , anj), and are 
conditionally independent with distributions I'Wn- Let Yn denote the vector (Yn 1, . . . ,1^^ d-i) £ 

Cd-l 

Notation. If (p : Sq~'^ R, let 

d-1 



Mv)^ '^{yi,---,yd-i)T\Mdyj), 

/od-l 

/•oo 

fi{ip) = / Hti^)d7T{t). 

Jo 

Thus nt is the law of (d — 1) independent copies of ut- 
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Theorem 6.1. If (f : Sq~^ ^ M is measurable and //(<^) is finite, then 



n-l 



lim n ^ (fiyj) = iJi{if) a.s.. 



n— >oo 

i=o 



Theorem 6.2. Suppose that 

(6.1) / /.,(^^)'log+(/.,(<^^))d7r(t) 

^0 



< oo. 



Then 

n-l 

where 

oo 

(6.2) al = 2(j2^Af^w,{v)f^Wo{'fi) - Hff)) +Hf'') - Piff < oc. 

Proof of Theorem 6.1. Let {Wj,j e Z_|_) and {Wj,j G Z_|_) be the chains with laws Fp and Ptt, 
respectively, which are coupled as in Corollary 5.7 (a). Let {Uj : j e Z+} be an independent 
sequence of i.i.d. random variables which are uniformly distributed on [0,1]. Define Gt{x) = 
• • • : viVi, ■ ■ ■ ,yd-i) < x}) and let Vj{t) = G^^{Uj) where G^^{u) = \n{{x : 

Gt{x) > u}. Therefore {Vj{t) : j G Z_|_} are i.i.d. and have distribution ij,t{ip{-) G •). Moreover 
Vj{t,uj) is jointly measurable because G^^{x) is. Theorem 2.5 shows that iVj{Wj),j G Z_|_) and 
{(p{Y^^^),j G Z+) are equal in law. By the coupling in Corollary 5.7 it suffices to show 

n-l 

(6.3) lim n-^ V Vj{Wj) = /i((/?) a.s. 

3=0 

However, Vj{Wj) = G^ (Uj) and since {(W^j, Uj),j G Z_|_} is clearly stationary and ergodic by 

Theorem 5.2, {Vj{Wj),j G Z_|_} is too. //(<^) is the mean of Vj{Wj) and therefore the ergodic 
theorem implies (6.3). □ 

Proof of Theorem 6.2. Let {{Wj, Wj, Vj) : j G Z+} be as in the previous proof. As in the above 
argument it suffices to show 

n-l 

= n-V2 ^ {y,{W,) - n^)) ^ iV(0, 4). 
Write $n = + Z^, where 

n-l n-l 

Xn = n-^/^Y.i^^^^o)- ^^wM) and Z„ = n-^/2 5^(/i^^.(¥')-MM)- 
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We will use the Lindeberg Central Limit Theorem to show that, conditional on W = (W^), 
Xn ^ N{0, af) and then use a Central Limit Theorem for stationary ergodic processes to 
prove that Z„ A?"(0, (t|). Introduce 



n— 1 n— 1 

j=0 j=0 

The Ergodic Theorem implies 

(6.4) lim s^n~^ = 'p{(p'^) — / fxti'p)'^ d7T{t) = af a.s. 

n— >oo y 

We claim that 

(6.5) lim P(e^^^" |W) = e'^^'^i/^ for all 6* G M. 

n— >oo 

If o"^ = this is clear from (6.4). Assume af > 0, let £ > and check the Lindeberg condition, 
conditional on W. For any fixed K > 0, use (6.4) and the Ergodic Theorem to conclude 

n-1 



limsups-^Y.^ ({Vj{W,) - ^i^^{ip))h{\Vj{W,) - {ip)\ > esn)\w) 

n^oo — ' \ J / 

n—1 „ 

= (^i^J J {fiy) - l^t{<f)Ti{\<fiy) - l^t{<f)\> K)diitiy)d7r{t) a.s. 

The last expression approaches zero as i^T ^ cxd because Jti^p"^) < cxo by (6.1). This gives us 
the Lindeberg condition with respect to P(-|W) a.s., and (6.5) then follows from the Lindeberg 
Central Limit Theorem and (6.4). 

It is easy to use the exponentially fast coupling given by Corollary 5.8 to see that {VFj} 
is strongly mixing with an exponential mixing rate (with the notation of Rio (1995)), a{n) < 
2c5.6exp(— As.an). This means that the same is true of the ergodic process {/i^.(<^) — A*(¥')}, 
and (6.1) allows us to apply a Central Limit Theorem for strongly mixing stationary processes 
(Theorem 1 of Rio (1995) and (1.5) of that work) to conclude that 

(6.6) lim P(e'^^") = e''^^"^/^, 

n— >oo 

where 

/oo 

The required result now follows from (6.5) and (6.6) because a'^ — af + □ 
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Example 6.3. Define ip^ : S^'' ^ [0, oo) by (^i((Ci, ai), . . . , {Cd-i^aa-i)) = Ei=i Then 

^iM) = {d-l) j{#Cfdvt + {d-l){d-2){j #Cdvtf 

< (d- l)c4.8(tt,rf) + (rf- l)((i- 2)c4.3(ci,(i)^ (Theorems 4.4 and 4.6). 
If Cn = U^Ji Clustn,i, then <^i(F(")) = #Cn, and Theorems 6.1 and 6.2 show that 

n— 1 „ „ 

(6.7) hm V #C,- = /i = (d - 1) / / H^Cdutdirit) a.s., 
and 

n-1 

(6.8) n-i/2^(#C',_Ai) ^iV(0,aJJ as n ^ oo, 

where a (not very helpful) expression for cr^^ > may be retrieved from Theorem 6.2. The 
expression for Ji is more tractable and Theorem 4.4(b) and (4.11) imply that 

(6.9) Q<Ji = fi{a, d) <{d- 1) sup j i^Cdut 

<{d- l)c4.3(a,d) 

< C6.i(c^)exp(c4.7(c^)/(1 - a))- 

We now consider some processes which describe aspects of the growth dynamics of {A^j}. 
The size of the cluster at the first time a node of height n is filled is nninsize(n) = min{fc : 
h{Ak) = n}. The size of the cluster at the last time a node of height n or less is added is 
maxsize(n) = max{/c : Ak — Ak-i E M{0,n)}. Let size(n) = #(^oo nB(0,n)) be the number of 
nodes in Aqq of height n or less. Then clearly 

(6.10) minsize(n) < size(n) < maxsize(n). 

Note that each point in Ao^ nB(0, n) is in a cluster Clustfc^i for some A; < n or is one of the first 
n+1 vertices along the backbone. If N{n) = Ylk=o #(^fc)) this gives 

(6.11) size(n) < iV(n) +n+ 1. 

For the next result is defined in Theorem 6.2 with (pi as in Example 6.3, and we define 

(6.12) rate = fi{a,d) + 1 > 1. 

Theorem 6.4. For {L{n)} = {size(n)}, {maxsize(n)} or {minsize(n)} the following results hold, 
(a) lim n~^L{n) = rate a.s. 

n— >oo 
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(b) n-^/^{L{n) - mate) N{0, a^J as n ^ oo. 

It is not hard to prove this by first using the bounds in Theorem 4.4 to show that maxsize(?i) — 
minsize(n) and N{n)+n+l—s\ze{n) remain bounded in probabihty as n — > oo, and then applying 
Example 6.3. The interested reader may find this argument in Lemma 6.5 of the earlier version 
of this work referred to in the introduction. We will prove Theorem 6.4 in the next Section by 
showing that the above differences are bounded in by means of a dynamical decomposition 
of Aoo into independent blocks. 

Let £{n) = min{|x| : x G — A^} be the height of the shortest vertex which is added 
after step n. Hence A^ fl B(0, £(n) — 1) is "frozen" for k > n. Clearly £(maxsize(/c) — I) < k < 
£(maxsize(/c)) and so 

maxsize(A;) = min{n : £{n) > k}. 

It is a simple matter to read off limit theorems for h{An) and £{n) from the corresponding results 
for their inverses, minsize and maxsize, respectively. Let no = iiQ{a,d) = rate"-^. 

Corollary 6.5. (a) lim n~^h(An) = lim n~^£(n) = /iq a.s. 

n— >oo n^oo 

(b)(i) n-^/^{h{An) - njjo) iV(0, (yl^^4) as n ^ oo. 
(ii) n~^/'^{£{n) - n^o) N{0, a'^^nl) as n ^ oo. 

Proof, (a) is a trivial consequence of Theorem 6.4 (a) with L{n) = minsize(n) or maxsize(?i). 

(b) Let H{n) = n-^/^{h{An) - nfio) and E{k) = k~^/^{m\ns\ze{k) - kjjQ^). Fix a; e E and set 
k{n) — [nno + Xy/n] + 1 where [z] is the greatest integer not exceeding z. If 
Xn — {'IT' ~ k{n) iiQ^)k{n)~^/'^ , then for n large enough so that k{n) G N, 

P{Hn <x) = P{h{An) < k{n)) = P(minsize(/c(n)) > n) = P(S(/c(n)) > Xn). 

3/2 

Since limx„ = — //g x, Theorem 6.4(b) with L{k) — (j{k) implies that 

lim P(Hn <x) = P{Z > -iiq^^^x) = P{Z < iiq^^^x), 

n— >oo 

where Z is a A^(0, cr^J random variable, (i) follows and a similar argument proves (ii). □ 

It is easy to translate these a.s. limit theorems as n ^ oo into continuous time results as 
t t Too- Note that 

Wk+i < (T^ -T(^))«-'=-i < W{f\rstk), 

where Firsts is the first vertex in B(A;) which is added to the cluster. Note that VF(Firstfe) is equal 
in law to Wq, and Wk+i >- tt for all k by Theorem 5.1(b). Use the above with the estimates 
(5.1) and (5.5) (for the left-hand tail of tt) and a Borel-Cantelli argument to see that 

(A; - l)-^a''-^ < Too - T^^^ < (logA;)a^ for large k a.s. 
By considering T^'^) < t < T^'^+i) this gives 

(6.13) h{Ct)~^a^^^'^ <Too-t< log{h{Ct))a''^^'^ for < T^o - t sufficiently small, a.s. 
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Theorem 6.6. (a) lim {h{C{t)){log^{T^ - t))'^ = 1 a.s. 
(b) lim #(C(t))(logjT^ - t))-' = rate a.s. 

Proof, (a) is immediate from (6.13). Corollary 6.5 (a) and a trivial interpolation (recall An = 
C(cardn)) shows that limtfT^ h{C (t)) / (t) = /iq a.s. (b) follows from this and (a). □ 

Theorem 6.7. (a) lim n~^(logQ,(Too — cardn)) = A*o a-S- 

n — >oo 

(b) n-^/^(log^(Too - cardn) - n/io) ^ iV(0, crj^/ig) as n ^ oo. 

Proof. For (a) set t = card^ in Theorem 6.6(b). (b) follows from Corollary 6.5(b) (i) and 
(6.14) lim n"^/^(/i(C(cardn)) - log^(Too - cardn)) = a.s. 

n^oo 

(6.14) is an easy consequence of (6.13) and the trivial bound h{C{cardn)) <n. □ 

7. A Decomposition of the Infinite Cluster into i.i.d. Blocks. 

The drawback of the decompositions of into clusters off the backbone (Theorem 2.5) 
is that it is not a dynamical decomposition. This means that some additional work is needed 
before the results on the growth rate of h{An) such as Corollary 6.5 can be derived from the 
limit theorems for the clusters (Clustn,i) given in Example 6.3. We now establish a dynamical 
decomposition of A^o into i.i.d. pieces, which will lead to a proof of Theorem 6.4, and will also 
be used in Section 8 to establish properties of the shape of the cluster "as viewed from the tip" . 

Let Cn = cj{Yk^i, Wk : k < n,i < d) y a{l{Wn < ^o)) V cr(spine^). Here 5q is a sufficiently 
small positive number whose precise value will be prescribed below. Let = V aiWn)- 
Clearly both (C) and (C.) are filtrations. 

Let Mn = min{an,i(a;) : x e C\ustn,i,i < d} and recall Cn = Ui<dClustn,i. Inductively 
define regeneration times {Rj : j E Z+} hy Rq = and 

Rj+i = min{A; > Rj : min Mna"- > Soa^ , Cn C B(0, k-n-2) 

Rj <n<k 

for all n e [Rj, k), Wk-i > aSo, Wk < Sq}. 

Here Sq > will be chosen below. Clearly Rj is a (Cn)-stopping time for all j G The 
first two and last conditions in the above inductive definition will imply that all the points 
in Aoo n B(0, Rj) are added to the cluster before all the points in A^q n B{Rj, oo) and that 
{spine^^. } is the only point in A^o of generation Rj. The next to last condition will ensure the 
independence of the blocks B{j), defined by 

B{j) = A^nM{Rj,Rj+^-l). 

Our goal is to prove the following theorem (for a sufficiently small 6o > 0). 
Theorem 7.1. The sequence of (Cn)-stopping times, {Rj : j E Z_|_}, is a.s. finite and satisfies 
(a) Aoo r\M{Rj) — {spine^^} and nninsize(i?j) = maxsize(i?j) for all j G Z_|_. 
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(b) P((#S(j))') < C7.i(a, d) for all j e Z+. 

(c) By (a) we may define Dj e Sq by 

B{j) = {(spine^.) ®x:xe Dj}, j e Z+. 

Also let T{j; x) = T(spinejj^ , (spine^^. ) ®x)a~^^ for x e Dj. Then {(D^ , T(j)) : j e Z+} are 
independent »So-valued random vectors and are identically distributed for j >1. Moreover 
for j > 1, 

F{{Dj,T{j)) e • \Cr,) = f" Pi{Do,4%) e-\Wo = w)dA{w)A{[0,do])-\ 

Jo 

Remark. We abuse notation slightly, and extend the definition of T{j; x) to all a; e B. Clearly 
T(0; x) = T^%{x) for all xeM. 

Lemma 7.2. Cn C Wn for all n e Z+. 

Proof. By Theorem 5.1 Wn is Wn-measurable, and spine„ is trivially Wn-measurable. Fix k < n, 
i < d, and a;, x' in B. Then for a Borel set B 

{0'k,i{x') e B, spine^ = = {VF(a;|A;) - T{x\k, ei{x\k + 1) © a;')Q;~^~^ e S, spine^ = x} G /a;- 

Therefore Ife^j is Wn-measurable. □ 

Proof of Theorem 7.1(a). Assume Rj < oo for a fixed j G N. If n < Rj, i < d and a; e Clust^^j, 
then 

(7.1) T(e^ © x) = r(oo) - a''+^a„,i(x) < T(oo) - aS^a^^ 

< T(oo) - a^'^+^Wn^ = ^(spine^.). 

Note we have used the fact that if Rj < oo and j G N, then M^a'^ > 5oa^^ for all n < Rj (and 
not just Rj-i < n < Rj). Similarly we have Cn C B(0, i?j — n — 2) for all n < Rj and this 
clearly implies that all the clusters which break off the backbone before Rj are in B(0, Rj — 1) 
and therefore 

(7.2) T{Rj) = T(spine^.), minsize(i2,) = #C(T(spine^.)). 

By (7.1) no new points are added to the first Rj clusters (Co, . . . Cr^-i) after time T(spine^^.). 
Hence spine^^. is the last node in B(0, Rj) added to Aqq- This means 

(7.3) maxsize(i?j) = #C(T(spine^.)). 

Clearly A^o fl B(i?j) = {spine^^ } since spine^^ is the last point added to A^o in B(0, i?j) and 
the first point added to Aoo in M{Rj). (7.2) and (7.3) also give minsize(i?j) = maxsize(i?j) and 
hence (a) is proved once we show Rj < oo for all j G Z+. 

The proof of Theorem 7.1(b) requires several preparatory lemmas. 
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Lemma 7.3. (a) utiT^in^^c a{x) < 5) < C4.3(log+(l/(5) + 1){5 A 1) = g{5) for all t, 5>0. 
(b) There exists e (a, 1) such that utiC ^ M{0,j - 1)) < for all j eZ+,t> 0. 
Proof, (a) We have 

z^tfrninafa;) < (5) < P( max T{x) > t - d\T{oo) > t) 
x€C xeC{t) 

<¥mC{t)-C{{t-S)-))\T{oo)>t) 

< g{5) (by Theorem 4.4(b)). 

(b) Using Theorem 4.4(a) we have 

ut{C t B(0, J - 1)) < P(5^ #(C(t) nB(n))|T(oo) > t) 

n>j 

<^cliC4.2«"+l=C4V + ', 
n>3 

while 

vt{C = 0) = ¥{Uq > t\T(oo) >t) = (e*P(T(oo) > t))-^ > cl\ (by Lemma 4.1). 

The result follows trivially from the above two inequalities. □ 

Lemma 7.4. If < 5o < 55.i(l/2, 1/2) {65,1 as in Lemma 5.5) there exist 07.2,07.3 > such 
that Nm = l{Wk-i > a6o, Wk < Sq) satisfies 

F{ex.p{-Nm)\Wo) < 07.2(^^0"^/^ + l)e-'^^-3"^ for aU m e N. 

Proof. For 60 as above and a fixed > let Mm — Z^fcLi ^iWk-i > cxdo){q — l(Wk < Sq)). 
Then 

P(e'^-|>V^_i) = exp(M^_i + ql{Wm-i > aSo)) 

X [eM-HWm-i > a6o))A{[0,6o AWm-i/a)])AmWm-i/a])-^ 
+ A((5o, W^-i/«])A([0, Wm-i/a))-'] (Theorem 5.1) 
< expiMm-i)[l{Wm-i < ado) 

+ l{Wm-i > a5o)e^(e-^A([0, So]) + A((5o, oo)))A([0, 00))"^]. 

As A((0, 5o]) > (see (5.9)), we may choose q = q{5o) > small enough such that e^"" is a 
supermartingale. If N!^ = Yll^=i ^O^k < ^Sq), then 

e^"^ = exp(5m - qN^^ - Nm) > liN;^ < m/2) exp(5m/2 - Nm) 

and therefore 

P(exp(-iV^)|iyo) < ^N;^ > m/2|iyo) +e-'^"'/2p(e^'"|Wo) 
< C5.3(Vro"'/^ + l)e-^^ i'" + e-'^'^/^. 

In the last line we used Lemma 5.5 and the supermartingale property of e^^'". □ 
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Lemma 7.5. F{Wr. e B\Cr.) = A{B\ [0,6o]) a.s. on {Rj < 00} for all j e N. 
Proof. li A e Cn we claim that 

(7.4) F{A\W) = nA\Wo, . . . , Wn-i, l{Wn < So)). 
To see this consider 

n-l 

^ = n n e Ak,i} f]{sp\ne^ = x}n {(Wo, . . . , Wn-i) eB,Wne D} 

k=0 i<d 

where D = [0,6q] or (5o,oo). (7.4) is then an easy consequence of Theorem 2.5 and the inde- 
pendence of spine^ and W (which holds by symmetry). (7.4) follows for general A G by a 
monotone class argument. 

Assume ^ e is a subset of {Wn-i > a6o, Wn < Sq}. Then 

F{A,WneB)^ jF{A\Wo,...,Wn-i,l{Wr,<do))liWr,eB)dF (by (7.4)) 

= j P(^| Wn_i, l{Wn < So))¥{Wn G 5| W„_i, l(W„ < 5o))dF, 

by the Markov property of W. Use the form of the transition kernel found in Theorem 5.1 to 
see that 

P(Wn e B\Wn-l, l{Wn < 5o)) = A{B\[0, min(5o, Wn-l / a)])l{Wr, < 60) 

+ A{B\{So, Wn-i/a])l{Wn > So, Wn-i/a > So). 

Therefore we have (by our assumption on A) 

(7.5) nA,Wn ^B) = J F{A\Wn-i,liWn < So))AiB\[0,So])dF 

= F{A)A{B\[0,So]). 

Let j eN, A e Cr. and An = An {Rj = n} e Cn- Clearly An C {Wn-i > aSo, Wn < So} and 
so by (7.5) 

00 

P(^, Rj < 00, Wr. eB) = J2 P(^n, Wn e B) 

n=l 

= F{A,Rj <oo)A{B\[0,So]). □ 
The next result is an easy consequence of Theorem 2.5. 
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Lemma 7.6. Let K be a (C„)-stopping time (possibly infinite). Then conditional on 

Ck V (7(W) and on {K < oo}, {YK+n,i '• n £ < d} are independent »So-valued random 

vectors such that 

HYK+n,i e A\Ck V (7(W)) = i^Wk+M) a.s. on {K < oo}. 
Proof of Theorem 7.1(b). Choose 6 < Sq < (^5.i(|, |) A 1 small enough so that 

oo oo 

(7.6) p,{5o) = 1 - l[{l-g{5oa^)y-' + 1 - JJll - P')"'' < 1- 

i=i j=i 
Here and /3 are as in Lemma 7.3. Choose no G N sufficiently large so that if 
h{5o) = C4. 3^0(1 + log 1/(^0 + log I/ck), then (recall P > a) 

(7.7) {d - l)h{5o)ja^ < (3^ for 3 > no and 7 = /^^HpU'^o) - f3''°)d{l - (3)-^ +Pi(5o) < L 

Fix j e and let /c(0) = Rj, and /c(0) < k{l) < . . . < k{n) < . . . denote the successive times 
for which Wk(n) < <^o and Wk(n)-i > ck<^o (n e N). If Rj = 00 set k{n) = 00 for all n. Each k{n) 
is a (Cn)-stopping time. Lemma 7.4 and the strong Markov property of W show that each k{n) 
is finite if Rj is. Let -ft'(?') = k{inQ) and define 

= {niin{M„Q!" ■.Rj<n< K{i)} < 5oa^^^}, 
Di = {Cn t B(0,iC(i) - n - 2) for some n in [Rj, K(z))}, 

^Ar = a'Ii(SiUA). 

Note that Rj-\-\ — Rj > m and K{N) < m + Rj together imply (since each K{i) ioi i < N 
must violate one of the defining conditions for Rj+i). If 

m 

< = 5] nWR^+k < So, WR^+k-i > aSo), 

k=l 

this easily gives for e > 

(7.8) {Rj+i - Rj >m}c {N^ < em} U Ay,rn/no] ■ 

If a; e An and Rjiui) < 00, then u G Bn U Dn, and so for some n in [Rj, K{N)), either Cn is 
not contained in 1(0, K{N) - n - 2) or M^a^ < ^qck^^^). Choose i = i{Lo) e {1, . . . , A^} such 
that K{i — 1) < n < K{i) and note that oj e ^i-i as well (a; e for all j < N). This shows 
that on {Rj < 00}, 

¥{An\Cr, V a{W)) 

N 

< J2HAi-i n {min(Mna" : K{i-1) <n< K{i)) < 5oa^^^^}\CR. V ct(W)) 

+ P(^i_i n {Cn t B(0, K(iV) - n - 2) for some n e [K{i - 1), i^(i))}|Cii. V ct(W)) 

AT 

= VP(1(^,_i){1 - TT v^S^^T^aix) > 5oa^(^)-^)'^-i + 1 

1=1 n=K{i-l) 
K{i)-1 

- n ^wAC cH0,K{N)-n-2)f-^}\CR.ya{W)) (by Lemma 7.6) 

n=K{i-l) 
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< y,f{a_,\cr. V a(w)){i - n (1 -^(5o«''^'^^""))'"' 

i=l n=K{i-l) 
K{t)-1 

+ 1- Yl {I - p^(^)-^)d-n (Lemma 7.3) 

n=K(i-l) 

<nAN-i\CR.V a{W))pi{So) 

N-l 

+ J2 P(^i-i|Cfl, Va(W))(d- 1)( Yl ^(<^o«''^'^^-") 

i=l n=K{i-l) 

N-l oo 

< nAN-i\CR^ V a(W))pi(5o) + 5^ P(A^-l|Ci^, V a(W)) J] 

i=l m—no{N-i)+l 

In the last line we used the first part of (7.7) and the inequality K{N) — K{i) > {N — i)no. A 
simple induction argument using the above and the definition of 7 in (7.7) gives 

(7.9) F{An\Cr^ V a(W)) < 7^ for all TV e N a.s. on {Rj < 00}. 

Use (7.8) with e — C7.3/2 and the strong Markov property of W with respect to (C^) (recall 
Lemma 7.2 and Theorem 5.1) to see that if m e N (recall P„ is the law of W starting at w) 

F{Rj+i-R, > m\CRj 

< P« < em\CR^) + P(A[,^/„„]|CfiJ 

<exp(£m)P;^(fl^.)(e-^-) +7-^(7'/"°)"^ (by (7.9)) 

< c{W{Rj)-^/^ + 1)6-^'"^ (by Lemma 7.4 and the choice of s). 

Now condition on Cr., use F{t) < dt (see (5.9)) if j = 0, and this together with Lemma 7.5 if 
j > 1, to derive 

(7.10) ^{Rj+i - Rj > m\CR.) < C7Ae~''^-^"' for all j e Z+, m e N. 
Let 

n-l 

m; = l{Rj < i < i?,+i)(#a -id- l)iywMC)), n e Z+. 

i=0 

Lemma 7.6 shows (M^, Vcr(W)) is a martingale and Theorems 2.5 and 4.6, and (7.10) readily 
show it is L^-bounded. By (a) each point in B{j) either belongs to a cluster which branched off 
the backbone at generation i G [Rj,Rj-^-i) or to the backbone itself. Therefore 

mimf) = p((E ^ ' < + 1))') 



< 



mM'oof) + p((E ^ ^ < Rj+^)i^w. mc)){d - 1) + i)f) 



< 2P((M^)2) + cP((i?j+i - Rjf) (Theorem 4.4) 

< C7.6 

by the above and (7.10). This proves (b), and also shows that Rj < 00 for all j a.s. 
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For (c), one more lemma is required. 

Lemma 7.7. For all j eN and measurable A e [0, oo)" we have 

P((T(j; x),xeM)e A\CrJ = / ° n{To{x), xeM)eA\Wo = w)dA{w)A{[0, 5o])-\ 

Jo 

Proof. Let p{{Ux, x eM — {0}) G ■ \w) be a regular conditional probability for (Ux, x e B — {0}) 
given Wo = w. Let U {j; x) — t/'((spine^^. ) (Bx) and choose F C B— {0} finite and B a measurable 
subset of [0,oo)^. We will prove 

(7.11) F{{Uij;x),xe F) e B\yVR.)iuj) = pi{U,,x e F) e B\WR.iuj)) a.s. 

Since T{j;x) — X]o/j/<x 2^)'^'^'' '^^'^ condition on Cr^ C Wi^^ and use Lemma 7.5 to 
obtain the desired result. Turning to (7.11), note that for A e y^R. and xq G B, 

(7.12) P((C/(j;x),xeF) eS,A,spine^. = o^o) = P((t/^oea., ^ e i^) e A, spine^. = xq), 

that An{spinefl^. = xq\ is in J'a;;, = E^^M a{yV{xQ)), and that {(t/^o©a;,a; e F) e S} e J^(a;o,oo)- 
The independence of Ex^ and .F(a;o,oo) and the inclusion (j{W{xq)) C .F(a;o,oo) therefore shows 
that (7.12) equals 

P(P((C/a.o®a;,x e F) e S|W(a;o)) l(A,spine^. = xq)) 

= ^{piiUx^x e F) e S|Wii,) l(A,spine^. = a;o)). 

Sum over xq to obtain (7.11). □ 

Since W and (l^n,i) are measurable functions of T'not(')' ^^^^^ ^ measurable map 
r : [0, oo)" — > N such that R\ = r(T^Q^). It is straightforward to check that 
Rj+i - Rj = r(T(j; •)) for all j e Z+ (use (a)). Define $ : [0, oo)" Sq by 

*rfot(-)) = (^rfot(-)),7^St(-)l^), where 

S^(t(°\(-)) = e B : T(°\(a;) < Jim inf{T^°\(2/) ■ Ivl = n}, \x\ < r(T(°\)}. 

Then {Dj,T{j;-)) = $(T(j; •)) for all j e Z+ and so Lemma 7.7 shows that if ^ C 5o is 
measurable, then 

niDj,T{j; .)) e ^|Ch,) = r°P($(TSt) e ^|Wo = «;)dA(«;)A([0,(5o])-^ 

= / °P((F»o,TSt) e ^|W^o = w)dAiw)Ai[0,5o])-'. 
Jo 
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To complete the proof of Theorem 7.1(c) it suffices to show that {Dj,T{j; ■)) is Cfl^+i- 
measurable for aU j e Z+. This reduces to showing that for a fixed a; e B, 

T(°\((spine^.) © x)l(r(°\((spine^^) (B x) < aWo, \x\ < Rj+i - Rj) 

(7.13) is Cit^._^^-measurable. 

If n e Z_|_ and xq G B satisfy n + |a;| < |a;o| then on {spine^_^.^^ = xo,Rj — n} we consider the 
foUowing two cases: 

Case 1. (a^ol'^) ® x < xq. 

Then (spine^^.) ® x — xo\{n + \x\), T|^Q',.(spine^^. Q) x) = aWo — a'^'^^^^Wn+\x\, and so 

{^not((sP'"^i?, ) ®x) eB, T^°\((spinefl.) ® x) < aWo, spine^^^^ = xq, Rj = n} 

= {aWo - a'^+l^'l G B, Rj+i = |a;o|, spine|^^| = xq, Rj = n} 

e C^xo\ (because n + |a;| < |a;o|). 

Case 2. Case 1 fails. 

Then there exists m e [n, \xo\ — 1) fl Z+, i < d and x' in B (depending on (x, xq)) such that 
(spine^^) (B X = ei{xo\m + 1) ® x'. Now T^q*,. (spine j^^) ® x) < aWo if and only if x' e Cm,i in 
which case T^Q',.((spine^^. ) ® x) = aWo — a'^'^^am,i{x'). Therefore 

{^not((sP'"^fl,- ®a;) e 5,T^°\((spinejj^. ® x) < aWo, spine^.^^ = xo,Rj = n} 

= {aWo - a'^'^'^am,iix') e B,x' e Cm,i, sp\ne\^^\ = xq, Rj+i = \xo\,Rj = n} 
e C\xq\ ( because n < m < \xo\). 

Taking the union over n < k — \x\ and |a;o| = A; in the above cases we have 

{T^°\((spineH.) ® x) e S,T('Jt((spine^. © a;) < aWo, 

for |a;| < Rj+i — Rj, Rj+i = k} & Ck 

and so (7.13) follows. □ 

We will now use Theorem 7.1 to complete the proof of the main limit theorem, Theorem 
6.4. Let I{n) = [Rk-i,Rk) n Z+ iff Rk-i < n < Rk and let 

oo 

Zn = J2#Cjl{j el{n)),nez+, 

3=0 

be the size of the "regeneration block" spanning generation n. Theorem 7.1(a) implies that 

(7.14) maxsize(n) < nninsize(n) + 
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and (recall that N{n) = Y2=l #(Cfe)) 

(7.15) 7V(n) + n + 1 < minsize(n) + Z, 



Lemma 7.8. The sequence {Z^} is bounded in L^. 

Proof. It follows easily from the exponential estimate (7.10) on the tail of — Rk-i and the 
Renewal Theorem (see (4.16) in Ch. XI of Feller (1971)) that 

(7.16) supP ((#/(n))^) < oo for aU q > 0. 

n 

Note that Theorems 2.5 and 4.4 together with the definition of Cj imply 

(7.17) P(#C,|C,)<(d-l)c4.3, 
and that 

oo oo 

i=o fe=l 

Take means in the above to conclude that ii p > 1, 

n — 1 oo 

j=0 fc=l 

oo oo 

+ ^ ^P(l(i?fc-i <n<j< i?fe)P(#C,|C,)) 

j=n k=l 

n — l oo 

< 5^p((#C,)(#/(j))'')(n-jr" + (rf- 1)C4.3P($^1(J e I{n) 



j=0 j=n 

< J]P((#C,)2)^/¥(#/(jf") (n-j)-^ + (^-lK3P(#/(n)). 

i=o 



Use (7.16) and Theorems 2.5 and 4.6 to see that the final expression above is uniformly bounded 
in n. □ 

Proof of Theorem 6.4- For each choice of L{n), writing 

L(n) = N{n) + n + l + An, 

the inequalities (6.10), (6.11), (7.14) and (7.15) imply that |A„| < and so is bounded in 
by the previous Lemma. The Borel-Cantelli Lemma implies n~^A„2 — > a.s. and so Example 
6.3 gives a.s. convergence along the subsequence {n^}. A standard interpolation argument 
completes the proof of (a). Since n~^/'^An — » in probability, (b) is now immediate from the 
Central Limit Theorem for N[n) (Example 6.3). □ 
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8. Some Concluding Remarks 

While equations (4.1) and (4.2) specify the law of T(oo) (see the Remark following 
Lemma 4.1), precise estimates on its distribution seem quite difficult to obtain. It is however 
possible to derive some asymptotic results as a t 1. Our starting point is the following 

Proposition 8.1. (Kingman (1975)). If a = 1 then n~^T(n) — > C8_i{d) a.s. as n — > oo, where 
c = C8.i{d) is the unique root in (0, 1) of dce^~'^ — 1 = 0. 

As a G (0, 1) will vary in the following, we will use notation such as Tq,(x) or Tq,(oo) to 
denote dependence on a. Bear in mind that the times Ta{x) are all defined on a common 
probability space as sums of the same variables Ux with different weights. 

Theorem 8.2. (1 — a)Tc,{oo) cs,i{d) as a t 1 a-s. and in L^. 

Proof. Fix e > 0, and use Proposition 8.1 to choose K{e,u)) such that n~^Ti{n) > cs.i — s for 
n > K{e,u). If a; G B, then 

T«(0, x)^Yl ^K) - Ti{0,x\i - 1)) 

i=l 

\x\ 

> J2 («'-«'+')ri(0,a;|i) + al^l+^Ti(0,x) 

i=K{s) 

\x\ 

>(c8.i— £) ^ {1 — a)ia'^ 

i=K{s) 

\x\-K{e) 

> {csA - e){l - a)a^^'^ J] ia\ 
Take the minimum over = M and let M — > oo to see that 

(8.1) liminf(l - a)Ta(oo) > C8.i{d) a.s. 

For the other direction, note that Ti{n)/n is uniformly L^-bounded (being stochastically 
smaller than the average of n i.i.d. exponentials) and so the convergence in Proposition 8.1 holds 
in L} as well. Choose large enough so that P(Ti(A")) < (cg.i + e)N . Now choose random 
vertices inductively generations apart so that = and x/c+i minimizes Ti^Xk^Xk+i) 
among all descendants of Xk- The times Ti{xk^ Xk+i) = Ti{xk+i) — Ti{xk) will be i.i.d. Use the 
crude bound 

CO 

Ta{oo) < y^Q!^^ri(a;fc,Xfc+i) 

fe=0 

together with summation by parts and the Strong Law (as above) to see that 
limsupQ,-[-i(l — q;)Tq,(oo) < cg.i + s almost surely, and that 

CO 

(8.2) P[(l - «)T«(oo)] < ^(1 - a)a'=^(c8.i + e)N ^ cg.i + s 

k=Q 
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as CK t 1 for fixed N. The first conclusion and (8.1) give the required almost sure convergence. 
Fatou's Lemma and (8.2) then show that the mean value of (1 — q;)Tq, (oo) approaches cg.i as 
q; t 1. Convergence in now follows. □ 

Remark 8.3. Recall from Lemma 4.1 that e*P(TQ,(oo) > t) increases to a finite limit 04.1(0:, d) 
as t ^ 00. It is possible to show that 

(8.3) lim(l — a) log 04.1(0;, d) — cs.i{d). 

a'[l 

The lower bound is an easy consequence of (8.1) and the monotonicity of e*P(TQ,(oo) > t) in 
t. The upper bound is more involved and we will not give a proof. It uses the anticipating 
equation (4.2). Equation (8.3) shows that the bound on C4.1 in Lemma 4.1 is far from optimal. 
For example if (i = 2, then cs.i{d) ~ .23 and (8.3) implies 04.1(0, 2) < exp((.23 + £)/(l — o)) for 
o close to 1; Lemma 4.1 gives the same kind of bound but with .23 + e replaced by log 2 pa .69. 

We conclude this paper by mentioning an associated particle system. Set 

ut{n) = #{dCt n B(n)), t > 0, n e Z+. 

The "particles" (i.e. sites in B on the boundary of the cluster Ct) evolve independently: each 
w-particle at a site n G dies at rate o"^, and is replaced by d particles at n + 1. The process 
U = {ut{.) : t > Qi) captures the essential features of the DLA processes Ct and An. only the 
labels of the branches are lost. For various limit theorems on the process U in the case o > 1 
see Aldous and Shields (1988). 

To study the evolving cluster for o < 1 it is more helpful to consider the following modifi- 
cation of U . Define a random time change at so that C^^ always adds neighbours to its deepest 
vertices at a constant rate: 

Mt = max{n : ut{n) > 0}, 

Lt= [ a~^'ds, 0<t< T(oo), 
Jo 

at = inf{s : Ls > t}, 
Vt{n) = u^^{M^^-n), t>0,neZ+, 

where ut{n) is taken to be zero for n < 0. At each time t, Vt is a function on Z_|_ counting how 
many vertices are in dCat at each level below the highest one. Note that < 00 if t < T(oo), 
and that lim^-|-T(oo) Lt = 00; thus at < T{oo) for all t > 0. Straightforward calculations show 
that the process V = (Vt(.) : t > 0) evolves as follows: 

(i) Particles at site n, n > 1, die at rate and are replaced by d particles at site n — 1. 

(ii) Particles at die at rate 1, and are replaced by d particles at site —1. The whole configu- 
ration is then immediately shifted to the right by 1 step. 

We call V the "tip process": it describes the form of the cluster when viewed backwards 
from the tip. The size of the process V near arises from the interaction of two effects: first the 
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strongly supercritical branching (at an accelerating rate as particles approach 0), and secondly 
the right shifts, which move particles away from 0, and so slow down their branching. Since 
the process V is a functional of C, it should be possible to deduce many properties of V from 
our results on C. However, just as some work was needed to obtain results such as Corollary 
6.5 (giving the growth rate of A^) from the cluster decomposition, so also passing from C to V 
is not completely straightforward. A further study of V may be the subject of a future paper: 
here we will just give a sketch proof that V is (in a certain sense) recurrent. 

For / : Z+ ^ M and A e (0, 1), set = E"=o A"|/(n)|. 

Theorem 8.4. Let A e (0, 1). Then the process HV^Ha is recurrent in the sense that there exists 
C8.2 < oo such that 

{t : \ \Vt\\x < C8.2} is unbounded. 
Proof. Recall from Section 7 the definition of the regeneration times Rj, and set 

e, = #(s(j-i)), 

Sj = inf{t > : M^, = 
It is clear that Sj < 00 for all j and that limj_»oo Sj = 00. Then 

ll^s,||A = X;A«^--«.,^(n) 

n=0 

3 3 

= E E 1(^^-1 ^ ^ < ^OA^'^-^Was, in) + u^s, (^3) < E ^'"'^^ + 1- 

i=l n i=l 

Here we have used the facts that Rj > Rj-i + 1, and that the cluster An H 18(0, Ri) is frozen 
for n > maxsize(i?i). By Theorem 7.1 > 1 are i.i.d. with P{Cj) < 00. Thus (b) follows by 
comparison with the interval recurrent Markov chain — ^17=1 X^~^^i- ^ 
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